
Photo 1: A Teletype Model 15 RO printer. This “receive only fp model is shown with cover removed (above) and with the cover 
in place . 
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Table 1: A listing of the 
various Baudot key code 
assignments. The weather 
code column, taken from 
the Reference Data for Ra- 
dio Engineers, fifth edi- 
tion, pages 35 to 38, table 
22, includes wind direc- 
tion and cloud cover 
symbols, in the communi- 
cations column, the fetters 
"WRU’’ stand for “Who 
are you?’’ 


Figures 


to Baudot Machines: 

Part 1, Description of Available Devices 


explain earlier mysteries. Psychologi- 
cally this is most comforting, because 
it wipes away the frustations that 
Invariably build up when neither you 
nor the computer appear to under- 
stand each other. 

EDN tears off the 6 foot or so strip 
of Teletype paper that results from the 
typical bout with the computer, folds 
it accordlonstyle, and files it. To make 
our reference complete we start each 
session by entering the day’s date and 
the session’s objective, and end it with 
a summary of what was, or wasn’t, 
accomplished. 

The compact hardcopy printout 
that you get is a much more useful 
debugging too I than the transient 


The hardcopy record provided by 
the Teletype printer proved unex- 
pectedly useful. Sample programs and 
their execution results ... could be 
reviewed many days later, often 
clearing up today’s cause of confu- 
sion.... 

Hardcopy records are especially 
help fuf during this learning process 
because when one attacks some new 
feature of programming a particular 
machine, he or she invariably begins 
by probing and experimenting, making 
many false tries. At that Instant, all 
inconclusive attempts seem like a 
complete waste of time. But later, as 
you advance, copies of your first 
attempts enable you to go back and 


Photos. 1-6 courtesy 
Clum. Photo 6 courts 
art W Cook. 




Speed Up the pP Design Process 
EDN, April 20 1 976, pages 63-72.] 


displays of console lamps, or the 
temporary frames of similar video 
readouts. You can study this hard- 
copy at leisure, make notes on it, and 
then file it away for future reference. 

All these practical virtues of per- 
forming program development through 
typewriter-like terminals have been 
known to computer types for two 
decades. These old timers are amazed 
to learn that it is taking the newcom- 
ers to the computer world so long to 
catch up. 

[“How Development Systems Can 


What Kinds of Baudot Machines are 
Available? 

Perhaps the best way to answer the 
question of what’s around is to catalogue 
each machine briefly, starting with the Tele- 
type Corp models and following with 'units 
by other manufacturers such as Klein- 
schmidt, Creed, etc. In these descriptions, 
I’ve included photographs of many, but 
hardly all, of the typical machines. Most of 
these units will be found with nameplates 
and number plates identifying the model and 
manufacturer (if you purchase a whole 
machine). ' , 


Model 11 

The first machine introduced with the 
name Teletype, in 1921. Used a type wheel 
and printed on paper tape at 40 words per 
minute. 


Beware: Model 15s May be 
Hazardous to Your Health! 

If you are considering a Teletype Model 
15 or similar surplus page printer, be fore- 
warned: Not only is the Model 15 obsolete 
(Teletype Corp no longer manufactures 
parts), but: 

• It's a rude, ugly mechanical mon- 
strosity next to LED readouts, shiny 
microprocessor cabinetry, and solid- 
state keyboards. 

• It’s a veritable assault of clanking, 
grinding, and roaring among the silent 
flash of panel lights, video displays, 
and the barely audible hum of power 
supply transformers. 

• Foul smelling oil and burning grease 
pour out of its bowels to- drown the 

i dean, woodsy smell of new program- 
ming books and operating manuals. 

; • It crawls along at a yawning, hair 
pulling, nail biting 60 to 100 words 
per minute, while a blazing processor 
hurries along wasting thousands of 
cycles between character outputs. 

But a Model 15 hardcopy page printer will 
cost between $35 and $100 depending on 
your resourcefulness, and luck. 


Teletype Model 12 

Introduced in 1922, about 12,000 of 
these units were sold. Practically all Model 
12s had been removed from service by the 
1940’s, which makes it a real antique. The 
unit consisted of a Baudot page printer and a 
keyboard, and sounded like a cement mixer. 
The electrical noise or hash generated' by 
these machines would require extensive 
shielding and cables for use with a micropro- 
cessor. Give it to a museum if you find 
one 


Teletype Model 14 

This series of Baudot paper tape readers 
and punches were manufactured beginning 
in 1925, with about 57,000 Model 14s of all 
types being sold. The Model 14 series con- 

• Model 14TD, or "transmitter distrib- 

utor” reads the paper tape and trans- 
mits serial data on a current loop. 
Speeds of 60, 66, 75 or TOO words per 
minute can be selected by installing 
the appropriate set of gears. Easily 
modified for parallel output. Price 
range from $20 to $150. ? V 

• Model 14 RO “Reperforator" or re- 
ceiver, punches paper tape from serial 
data received from at current loop. Tit 
1940 a typing reperforator was intro- Cj 
duced which also prints on the tape,.': 
but the printing is several characters ; - ■ 
behind the punches. Speeds are 60, 66, 
and 75 words per minute. Price range s ;| 
from $10 to $50. 


Table- 2: Physical characteristics of the four common teleprinter speeds for 
Baudot machines. In many cases, the same machine can be run at several 
different speeds simply by changing gears. ? > i. 


Stop 

PuIn Width 

31 ms 
30 ms , 
25 ms 
19 ms 


45 bps 
50 bps 
57 bpt 
75 bp* 
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Teletype Model 20 

This model was used by the wire services 
andwas similar jo the Model 15 except it 
used a six lever code for an expanded 
character set This is a rare machine and I 
have not seen it advertised. 


teletype Model 26 

Another rare, light duty machine used by 
the wire services. 


-■--■•yy 


Photo 2: The author’s 
Model 28 KSR Teletype 
machine. Note the much 
more compact mechanism 
(above) of this model 
compared to a similar view 
of the previous genera- 
tion's Model 15 in photo 
1. The Model 28 KSR with 
cover in place is shown 
below. 


• Model 14 KSR: A Model 14 RO with 
a keyboard. Enables punching of paper 
tape off line. I’ve seen prices range 
from $15 to $75. 




Teletype Model 15 


Teletype Model 19 


This machine comes in two versions: a 
Model 15 KSR with page printer and key- 
board and a Model 15 RO with page printer 
only (see photo 1). About 237,000 Model 
15s were produced from 1930 to 1957, with 
Bell Systems receiving about 100,000 of 
these, the US government about 60,000, 
and Western Union, the wire services, news- 
papers, railroads, airlines and RCA Corp 
making up the balance. The page printer is 
usually set up for 60 words per minute, but 
occasionally 66 and 75 words per minute 
machines are encountered. The typical used 
price ranges from $30 to $200. 


A set of Baudot machines which includes 
a Model 15 page printer and keyboard, a 
paper tape perforator, a Model 14TD reader, 
and a heavy duty loop supply. On some 
Model 19s, the punch is mechanically linked 
to the keyboard. Thus, tape must be 
punched locally and cannot be punched via 
computer control. The punch magnet re 
quires a 100 VDC source capable of sup- 
plying one amp. The tape reader is usually 
connected in series with the keyboard so 
that serial Baudot signals can be generated 
from either unit. The surplus price ranges 
from $125 to $250. 




& 
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is from 6 to 12 months. None are in stock 
and each machine is built to order. Used 
prices range from $200 to $1 500. 


Photo 4: A view of a 
Kleinschmidt Model J50 
KSR. This variant of the 
basic Teletype design 
(note its similarity to the 
Model 15) contains a cur- 
rent loop power supply. 


Teletype Model 28 

A general number for a series of Baudot 
machines, which includes ROs, KSRs, ASRs, 
TDs and reperforators. These are heavy duty 
machines like the Model 15, but generally 
operate at 100 words per minute. This is the 
most prevalent Teletype model used by the 
MARS people. Since production began in 
1952, almost 300,000 Model 28s have been 
sold. This machine is still being produced so 
new replacement parts are plentiful. Photo 
2 shows a Model 28 KS R. 

There is an actual advantage of owning a 
used Model 28 as opposed to the popular 
Model 33 ASCII machines. This is because 
the Model 28 is a heavy duty machine made 
for 24 hours per day operation, year after 
year. However, the Model 33 is a light duly 
machine, which means it was designed .for 
limited use: 3 to 4 hours per day, average. lt 
will work for reasonably long periods of 
time- but wiB~-te.quire maintenance and 
adjustment more often than the Model 28 or 
any of the other heavy duty machines such 
as the Models 14,15 and 19. 

New prices from the Teletype Corp at last 
report were: Model 28 RO, $1750 to $2000; 
28 KSR, $2000 to $2250; 28 ASR, $3300 
to $3900; prices vary with options; delivery 


Teletype Model 29 

Another rare machine,: essentially a spe- 
cial Model 28 built for IBM. 


Teletype Model 31 

A rare teleprinter and keyboard combina- 
tion, designed to print on 0.25 inch (0.64 
cm) paper tape. This was a small piece of 
military aircraft equipment (nomenclature 
TT-30/AGA-1) which was designed shortly 
after World War'll. It measured 9 by 14 by 
12 inches (23 cm by 36 cm by 31 cm) and 
weighed 23 pounds (10.4 kg),' Althou^i 
assigned the Teletype number 31, there is 
little indication that there was ever any 
civilian production. 


Teletype Model 32 

Looks just, _ like the ASCII Model 33 
except it has a three row Baudot keyboard. 
This is a light duty, 100 words per minute 
Baudot machine still in commercial -use. 
Available as RO, KSR and ASR, Used price 
£ Continued on page 154 
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Continued from page 17 


Photo 5: A view with 
cover lifted of the author’s 
Kieinschmidt Model 
TT-76A (military nomen- 
clature). This machine, is a 
paper tape punch and 
reader only, which 
actually types characters 
on the tape. Like the 
Kieinschmidt in photo 4, 
this unit contains a loop 
power supply. 


ranges from $100 to $1200, typically $400. 
Teletype Model 33 

This light duty eight level ASCII machine 
(see photo 3) is extremely popular in mini- 
computer circles, and could be called “the 
standard Teletype.” It is widely sold with 
new computer systems. More than 600,000 
Model 33s have been sold. New Teletype 
Corp prices are: 33 RO, $649: 33 KSR 


Teletype Model 40 ^ v (1 

The latest ASCII Teletype Corp machini 
includes a line printer, keyboard, CRT di: 
play, etc. The high speed line printer witl 
this set is an 80 or 132 column, 300 lines pe 
minute, heavy duty unit. Prices in the $1401 
to $2000 range. 


Teletype Model 35 

A heavy duty version of the light duty 
ASCII Model 33 ASR. New price is $3800; 

Teletyp^Model 37 

A heavy duty, 150 words per minute 


Kieinschmidt 

Kieinschmidt- machines 









Photo 6: Two views of the 
Creed 75 unit, in (a), the 
unit is shown with the 
cover in place as it might 
be seen in use. In the 
second view (b) the cover 
has been removed, and the 
paper tape supply drawer 
opened for inspection. 


teletypewriters for Spiegel in Chicago in the 
early 1960s, which were eventually passed 
on to a company named Wilcox Enterprises 
in Naperville I L. ■ -|S 

The basic unit shown in photo & 
consists of a page printer, keyboard and 
paper tape punch, but no paper tape reader. 
This machine uses a Baudot code at 10 
characters per second. Some differences 
from American machines are that the keyJ 
board a fid printer are mechanically linked; 
ie: no loop supply is needed for locafj 
typewriter like operation. The printer is run 
by activating a 24 VDC solenoid for space 
and another one for mark; thus there is.nd 
current loop. • ^ 

The keyboard output consists of five 
parallel bits and one strobe bit to indicate 
character ready, although it can be rewired 
for serial operation. ’ . . j 

Also available from Wilcox Enterprises 
are interface kits, programs, and erasable, 
programmable, read only memories which 
adapt the machine for use with 8008 and 
8080 systems. J 


and punch sets (TT-179) or a set designated 
the TT-100, which includes a page printer, 
keyboard, table'and power supply. These 
units are fairly common although not in as 
plentiful supply as their Teletype counter- 
parts. As with Teletype Corp, many other 
Kleinschmidt configuations were manu- 
factured for the military under various 
nomenclatures such as TT-65, TT-98, 
TT-271, etc. For example, a Model TT-24 is 
ai 100 word. per minute KSR machine. Photo 
4 shows a Kleinschmidt Model 150 KSR, 
which is typical of this line. My personal 
typing paper tape reader and punch set. is 
labeled “TT-76A/GGC” and is shown in 
photo 5. Price range, $20 to $150. 


Lorenz 

A German company which, like Klein- 
schmidt, has manufactured Model 15 KSRs 
and 15 ASRs. These machines are relatively 
rare in the US. 


Mite 

This New Haven CT company has manu- 
factured small Baudot page printer and 
keyboard machines for use in submarines 
and by the US Marine Corps. This device is 
relatively rare on the surplus market 


What Characteristics Should 
I Look For? ' 

Once you’ve Tound a Baudot machine anc 
are.' thinking about whether or not to buy it 
two items are of key importance beside: 
general mechanical condition; these art 
machine speed and character types. r 
Basically these units were supplied . witt 
three different kinds of character sets: com 

• T- vt ■ - J : 


Creed Model 75 Teletypewriter 

ITT Creed is the English counterpart of 
Teletype Corp. This company manufactured 
several hundred special Creed Model 75 

t": t56 









mercial, weather and stock market. Table 1 
lists the characters which correspond to 
particular 5 bit Baudot inputs. Note that in 
all three cases, the letters, lower case, are the 
same, but the figures, upper case, are dif- 
ferent, in fact some of them are downright 
strange! 

Obviously the most desirable character 
set for microprocessor use is the commercial 
or communications group, but don’t let a 
stock market or weirdo weather font stop 
you! Replacement commercial keys are 
available for $0.50 to $1 each from most of 
the sources listed. 

There are four printing speeds associated 
with these old machines, those being 60, 66, 
75 and 100 words per minute. Table 2 
summarizes the data rates, bit pulse widths, 
character speeds, etc, for each of the four 
speeds. More about how to increase machine 
speeds later.* 


Among the companies which advertise tele- 
printers in the various amateur radio magazines, 
computer journals and other "surplus" advertising 
spots are the following: 

Goodman, 5454 S Shore Dr, Chicago I L 6061 5 
(312) 752-1000. 

Teletypewriter Communications Specialists 
(Van's Electronics), 550 Springfield Av, 
Berkely Heights NJ 07922, (201 ) 464-5310. 

Andy Electronics Co Inc, 6319 Long Dr, 
Houston TX 77087, (713) 641-0576. 

Typetronics, POB 8873, Ft Lauderdale FL 
33310, (305) 583-1340. 

Lawrence R Pfleger, 532 W Wilson St #1, 
Madison Wl 53703, (608) 255-2935. 

Atlantic Surplus Sales, 3730 Nautilus Av, 
Brooklyn NY 11224, (212) 372-0349. 

Henry Holder, (817) 429-3021. 

Don C Dando, Rt 4, POB 454, Liberty MO 
64068. 

BVE Enterprises, POB 73, Paramus NJ 07652. 

Alltronics-Howard Co, POB 19, Boston MA 
02101, (617) 742-0048 

Wilcox Enterprises, 25W178 39th St, Naperville 
I L 60540, (312) 420-8601 

This list was culled from the publications circa, 
uly 1976. No inventory or prices are listed since 
s one supplier (Goodman) put it in a conversation, 

I have hundreds of 15s and 19s now, but they 
ametimes move slow, or sometimes fast, and I 
ould sell 'em all tomorrow.". In each case, where 
vailable, a phone number is listed so that the 


Next, Month: Baudot tele- ; 
printer information con- 
tinues with a discussion of , 
interfacing techniques and > 
problems. 
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Last month author Mike McNatt described a number of Baudot type 
teleprinters that are available on the surplus market today. Let’s assume that 
you have since rushed out in the intervening period and acquired your own 
Baudot teleprinter. To use it with your system, some method must be devised! 
to enable communication between the processor and the serial Baudot device, ■ 
Mike’s topic in this second part of a 3 part series. 


A Guide 


To use a Baudot page printer or keyboard 
with your computer system, some method 
must be devised to enable communications 
between the processor and the serial Baudot 
device. Traditionally,, these interface 
methods have been classified as either soft- 
ware or hardware. Both methods seem to 
have supporters who are radically convinced 
that the method they’ve chosen is the best. 
Strictly speaking, none of the Baudot ma- 
chine interface methods can be called purely 
software or purely hardware; they all lie 
somewhere in between. However, the hard- 
ware code converter obviously requires the 
least amount of new software generation. 


When using hardware conversion front 
ASCII to Baudot, a handshaking arrange- 
ment is used whereby the Baudot machine; 
tells the processor when it is ready to accept 
a character. Parallel ASCII data generated by 
the program’s output routine is then strobecf 
into the converter via a load pulse from the- 
processor. 

The converter’s ready line changes logic; 
levels, indicating that it is in a busy state.; 
During this state, the processor stands- by] 
while the converter 

1. generates a figures or letters shift 
character if required, 

2. converts the shift character into serial 

pulses which are sent to the Baudot 
machine, " 

, 3. converts the parallel ASCII character 
just loaded into an equivalent parallel 
. Baudot character via a programmable 
read only memory lookup table, and 

4. converts this new Baudot character 
into serial pulses wh ich are sent to the: 
Baudot machine. 


Michael S McNatt 
4658 E 57 St 
Tulsa OK 74135 


Hardware Interfaces, Code Converters 

The chief advantage of these converters is 
that they allow the use of Baudot machines 
with computer software which is written to 
input and output exclusively in ASCII code. 
This is a major factor when you consider 
that much of the microcomputer software 
available is written with ASCII 10. Figure 1 
illustrates the use of these converters, which 
consist of circuitry to (a) convert the parallel 
ASCII code from the processor data bus into 
serial Baudot code for use with page printers 
or paper tape perforators or (b) convert the 
serial Baudot output from a keyboard or 
tape reader into parallel ASCII code for 
presentation to the processor data bus. 


The converter’s ready line then changes 
back to its original state, indicating that it is, 
ready to accept another character. i' r .> 

A variation of this approach which may 
not require busy and ready handshaking is ; 
to use a long, serial shift register buffer. This 
would in effect compensate for speed dif- 



HIGH VOLTAGE 
LOOP SUPPLY 


Figure i: The block dia- 
gram describing the use of 
hardware code converters 
interfaced with a 
microprocessor. 



to Baudot Machines 


Part 2, Interfacing Techniques 


11a, 15, 16, and 21) and are available as 
printed circuit board assemblies from several 
of the companies listed in a separate box 
labelled “Sources of Baudot Hardware.” Ref- 
erences 9 and 10 present parallel to parallel 
code conversion circuits which require an 
additional UART or similar circuitry to 
obtain serial data for, or from, the Baudot 
machine. 

Software Interfaces 

In the purists’ software approach, one 
routine is used to construct the serial Baudot 
code for output, and another routine is used 
to decipher the incoming Baudot code for 
input to the operating program. Routines 
used to accomplish these functions are some- 
times referred to as timed software, since the 
bit timing of each character, 60 wpm, 100 
wpm, etc, is derived from software timing 
loops. For example, to derive the precise 
22.0 ms pulse periods required for 60 words 
per minute operation, a set of instructions 
which requires 12.7 pa to execute would be 
repeated 1,732 times for each bit of serial 
data. .. - 

The hardware requirement is held to an 
absolute minimum in both the input and 


ferences in an ASCII 110 bps serial output 
and the slower 45 to 75 bps rates of the 
Baudot machines. Since most interface mes- 
sages are less than one line of 88 characters 
in length, one line of shift register buffering 
may suffice; see reference 22, volume 2, #1, 
January 1976, page 9. ! 

When using hardware conversion from 
Baudot to ASCII, serial pulses from the 
Baudot machine are loaded into a shift 
register and presented as five parallel lines 
representing one character to another read 
only memory lookup table. Circuitry in the 
converter keeps track of whether the char- 
acter has been preceded by a letters or 
figures code, and presents this information 
to the read only memory as a sixth input 
line. When the parallel ASCII output from 
this memory is ready, the converter gen- 
erates a data-ready pulse, during which the 
processor loads in the ASCI I character. This 
conversion operates in a manner similar to 
that of a standard ASCII keyboard which 
generates a strobe or ready pulse after each 


keystroke, indicating that the character may 
be loaded into the processor. 

Circuits for accomplishing these conver- 
sions have been published, (references 11, 














output interfaces. The processor serial out- 
put interface can be a simple two transistor 
circuit, optoisolator or relay which opens 
and closes the teleprinter’s current loop. 


Several comments observed in the RTTY 
Journal (September 1973, page 13, for exam- 
ple) indicate that a Motorola MOC 1000 
optical coupler has been used successfully up 


GLOSSARY OF TELETYPE TERMS 


Mark or "Closed Loop": A condition of steady 
current flow in the teleprinter loop; see clutch. 


AFSK: Audio Frequency Shift Keying; see Fre- 
quency Shift Keyer. 


MARS: The Military Affiliate Radio System, 
whereby civilian amateur radio operators, hams, 
assist the armed forces with message communica- 
tions, at the same time acquiring valuable training 
in military communications procedures. MARS is 
operated jointly by the Navy, Air Force and Army. 

Paper Tapes: The narrow 0.375 in (0.953 cm) tape 
is usually gum backed and is used with a strip 
printer which just types messages on narrow tape. 
Western Union attaches strips of this tape to paper 
in their telegram operations. The wide 0.6875 inch 
(1.746 cm) tape is used with typing reperforators, 
chadless, and nontyping reperforators, chad type, 
which punch holes in the tape. 

Perforator: A unit which can punch tape from an 
attached Baudot keyboard input only because the 
perforator operates from mechanical linkages to 
this keyboard. Cannot punch tapes from an exter- 
nal current interrupting source. "f 


ASR: Automatic Send and Receive. Refers to a 
teleprinter system which includes a page printer, a 
keyboard, a paper tape punch and a paper tape 
reader. 


Chad, Chadless: A classification of paper tape 
punch machines, referring to whether the chad or 
small paper circles remain as a part of (chadless) or 
are completely removed from the punched tape. A 
quote: "The chadless tape does not wind well, 
takes up more space when wound and occasionally 
will not read correctly if the little tabs of paper get 
stuck back in the holes." (Dan Nolan, RTTY 
Journal , February 1976, page 17.) However, these 
potential problems may be insignificant if a parti- 
cular chadless machine is priced right. 

Clutch: A term used to denote that current is 
continuously flowing in the Teletype loop supply, 
uninterrupted by any data input. The clutch or 
marking state is used to prevent the machine from 
running open and producing nothing but mechan- 
ical clatter. 


Reperforator: Derived from receiving perforator. 
This unit can receive serial Baudot paperpunching 
messages from both the keyboard and an external 
current loop interrupter. . ' •) 


Converter, Terminal Unit (TU), or Demodulator: A 
radioteletype (RTTY) term which indicates a 
device that accepts the two audio tones, represen- 
ting mark and space, from the output of a receiver 
and converts them into current loop pulses for 
displaying characters on a page printer. 


RO: Read Only. Refers to a teleprinter system 
which includes a page printer only. 


RTTY : Radioteletype: A form of amateur radio 
communications which uses teleprinter machines 
for transmitting and receiving messages via radio 
waves. , 


CR and LF: Carriage Return and Line Feed. 


Demodulator: See Converter. 


Space or Open Loop: A condition of 
flow in the teleprinter loop. 


no current 


Frequency Shift Keyer, FSK: A radio Teletype 
term which indicates a device that converts the 
current loop pulses from a keyboard or paper tape 
reader into audio tones. An amateur radio trans- 
mitter then converts these tones to radio waves 
suitable for communications use. 


Stunt Box: A mechanical teleprinter attachment 
which performs certain nonprinting functions or 
stunts. These may include turning on a light, 
switching from nonprint to print, automatic car- 
riage return and line feed, vertical and horizontal 
tabulation, backspace, two color ribbon, etc. These 
are rather complicated. Rube Goldberg contrap- 
tions, and are difficult to service: 


Governor Motor: A nonsynchronous teleprinter 
motor, the speed of which is normally adjusted 
mechanically by viewing white strobe dots on the 
rotating assembly through a slit shutter tuning 
fork. Some hams have built speed controls for 
these machines, enabling electrical adjustment 

from 60 to over 100 words per minute. 

KSR: Keyboard Send and Receive, Refers to a 
teleprinter system which includes a page printer 
and a keyboard only. 


Synchronous Motor: ’ A 


motor which rotates 
synchronously at a constant speeds which is directly 
proportional to the AC line frequency. 


Terminal Unit,TU: See Converter. 


T ransmitter Distributor, TD, or Tape Reader: A 
machine which senses the characters punched into, 
a paper , tape and sends them automatically at a ’ 
constant speed over a teleprinter loop circuit. 

Typing Reperforator or Tape Punch: A machine 

whirh minrhot RnaiWnf rvnrlnrl nknWInM . 


Ham: A radio amateur. 


Loop Supply: A DC power supply used to drive 
teleprinter selector magnets; capable of supplying 
20. or more commonly 60 mA at 130 to 260 VDC. 
Below 130 V, loop time constants can cause 
difficulty. Western Union telegraph loops usually 
work with 260 V. 


which punches Baudot coded chadless holes into a 
paper tape and prints the associated characters 
right on the tape. 


WPM: Words Per Minute. 


, * ■ 
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to 130 VDC in this application. The serial 
input interface is an equally simple mini- 
mum component count circuit (see refer- 
ences 13 and 14). 

There are two ways to input and output 
the serial Baudot data with this software 
approach: 

1. Use operating software which inputs 
and outputs Baudot directly, as in the 
Baudot Monitor and Editor programs 
listed in reference 20. 

2. Use operating software which nor- 
mally uses ASCII input and output. 
However, provide additional code con- 
version lookup table software (see 


SOURCES OF BAUDOT SOFTWARE 

1. MITS, 2540 Alamo SE, Albuquer 
87106. Selected programs from tl 
Software Library: 

(a) =728752 by J R Scott, $2, 36 I 
translates some Baudot character 


SOURCES OF BAUDOT HARDWARE 

1. Processor Technology, 2465 Fourth St 
Berkeley CA 94710. 3P + S 10 Module, $125 
kit, $165 assembled, Altair/IMSAI plug com- 
patible. Includes serial 10 ports under software 
control, from 35 to 9600 bps for use with older 
Baudot machines. 

2. HAL Communications Corp, 807 E Green St, 
POB 365, Urbana IL 61801. MCEM-8080 
Microcomputer System, $375 assembled. Soft- 
ware monitor supports ASCII or Baudot code; 
optoisolator loop current driver and receiver. 

3. Mini Micro Mart, 1618 James St, Syracuse NY 
13203. Code Converter Boards: (1) Parallel 
ASCII to Serial Baudot kit #GPA-B, $34.95, 
(2) Serial Baudot to Parallel ASCII kit #TVT 
2540 - BSI, $34.95. 

4. MITS, 2450 Alamo SE, Albuquerque NM 
87106. The Altair 680 microcomputer $466 


byte program 
»r codes to 

ASCII equivalent. 

(b) =5-24-762 by Walter King, S2, 50 byte 
program loads object code from 5 level 
paper tape. 

(c) =5-24-765 by Walter King, $2, 71 byte 
program punches 5 level object tapes which 
can be read by #5-24-762. 

Id) =6-18-761 by James B Hansen, 215 byte 
program, title: ASCII to Baudot Translate 
Routine. 

(e) =6-22-761 by Jim Wiggins, (1) 106 byte 
program, title: Tape Load — Octal TLQ, (2) 
186 byte program, title: Tape Dump - 
Octal TDQ, for Baudot Teletypes. 

The prices shown assume membership, $10 per 
year, in the Altair User's Club. An interesting 
note is that MITS is accepting no more Baudot 
code converter or Baudot paper tape punch and 
load programs into their library because it is 
overstocked with these types of programs. 

2. M6800 User's Group Library, Motorola Inc 
Microsystems. 3102 N 56th St, Phoenix AZ 
85018. Reentrant ASCII to Baudot Converter 
Subroutine, Astrobaud. Other Baudot related 
programs may become available. A two year 
membership into this user's group is available 
for Si 00 or to anyone who submits an entry to 
the Program Library. 

3. Intel Corp, User's Library, Microcomputer 
Systems. 3065 Bowers Av, Santa Clara CA 
95051. Paper Tape Conversion program in 4040 
language, converts 5 level paper tape informa- 
tion to ASCII 8 level paper tape. Other 4040 
and 8080 Baudot code related programs may 
become available. A one year membership in 
the user's group is $100 or is available if a 


7 - Polymorphic Systems, 737 S Kellogg, Goleta 
CA 93017. A Serial IO Option, $75 kit. can be 
added to the Poly-88 microcomputer kit which 
provides a 50 bps minimum lOrate. 

8. Ohio Scientific Instruments, OSI, POB 374 
Hudson OH 44236. Model 41 2B Supercard is 
designated for Baudot Terminals, $129 kit, 
based on MOS Technology 6502 microproces- 
sor. 

9. See reference 17. 
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proach, whereby parallel Baudot characters 
are generated by the software. These charac- 
ters are presented to the data bus where 
busy and ready handshaking logic described 
in the above hardware interface description 
is used. In effect, the software performs the 
code conversion or parallel 10 function and 
the hardware performs the parallel to serial 
conversion and character bit timing 
functions. 

A UART or similar circuit (see reference 
17) can be used to implement these 
functions. The whole procedure operates in 
reverse when the serial Baudot characters are 
input to the hardware portion of the 
interface and presented in parallel form to 
the data bus. 

The Interrupt Driven Interface 

These old surplus Baudot machines may 


of course be interfaced to the processor if* 
using an interrupt scheme, with priorities % 
assigned to each Baudot device along with * 
other system peripherals. Interrupt schemes ( 
are beyond the scope of this discussion since Af 

there are so many variations of software and i 
hardware interrupt techniques, and they ‘ 4 . 
vary according to the processor being used. ■' 
Many, and possibly most, computer hobby- ■ 1 
ist systems do not use interrupts at all. 
Usually in this class of system, either there ■ 
are few peripherals being used or the oper- ■ 
ator has lots of time on his hands to WAIT ! 
on the peripherals, or both. One example 1 
would be a system with a keyboard and page A 
printer combination operating with monitor * 
and debug programs. 

For a system with several 10 devices and . ■/ 
very busy software, setting up an interrupt ^ 
scheme may become a matter of necessity. If v : ; 


REFERENCES ON BAUDOT TELETYPES 

Many of these references, especially the ama- 
teur radio magazines and books, may be available 
for loan from a friendly ham in your area. Some 
ham operators have been known to hoard all of the 
issues of some of their type journals, such as Ham 
Radio. CQ, 73, or QST magazines. It should be 
known that a similar affliction has been found with 
many BYTE readers . . . 

1. RTTY Journal, POB 837, Royal Oak Ml 
48068, 10 issues per year, $.35 per issue. 

2. RTTY from A to Z by Durward J Tucker, 
Corvan Publishing Corp, 1970, 214 pages, $5. 
Detailed information on operation, mechani- 
cal characteristics and maintenance of practi- 
cally all of the older Teletype machines. 

3. The Teleprinter Handbook, first edition by 
D J Goacher and J G Denny, Radio Society of 
Great Britain, 1973, $14.95. Distributed in 
US by Ham Radio Books, Greenville NH 
03048. An engineering textbook covering the 
theory and practice of radio teleprinter equip- 
ment; 30 pages of pictures and wiring dia- 
grams of Models 15, 19, and a few pages on 
the Model 28. Also covered are the design of 
power supplies, demodulators and Teletype 
test equipment. 


Shinsel, Vic: "RTTY Signal Generator," Ham 
Radio Magazine, March 1971, pages 23 to 29. 
RTL version of RY generator circuit. 


Dean, J A and Rupley, J P; RCA Application 
Note ICAN-6267, "Astable and Monostable 
Oscillators Using RCA COS/MOS Digital Inte- 
grated Circuits," 8 pages. 


The TIL Cookbook by Donald M Lancaster, 
Howard Sams Inc, pages 152 to 153. A 
parallel Baudot to parallel ASCII conversion 
circuit is presented which uses 5 ICs including 
2 PROMs. . 


Ellsworth, Cole: "ASCII to Baudot Con- 
verter," 73 Magazine, February 1976, pages 
52 to 57. A 15 1C parallel ASCII to parallel 
Baudot code converter. Additional UART 
circuit required to obtain serial Baudot for 
teleprinters. 


Hutton, Louis I: "Build This Exciting New 
TVT," 73 Magazine, March 1976, pages 76 to 
83. Included is a Teletype current loop 
Baudot code to parallel ASCII converter for 
driving a video terminal display, page 80. 


11a. Hutton, Louis I: "Code Converter Using 
PROMs," 73 Magazine, May 1976, pages 84 to 
86. Improved circuit using readily available 
PROMs; refers to reference 11. S \ 


4. Specialized Communications Techniques for 
the Radio Amateur, by the American Radio 
Relay League, 1975, 208 pages, $3. Chapter 
5: Radioteletype, 70 pages. A condensed 
overview of many of the Baudot machines, a 
table of Teletype Gear numbers for Models 
14, 15 and 28, and many other basics. 


12. Andreasen, C Warren: "A Versatile TTY 
Generator," 73 Magazine, April 1976, pages 
90 to 91. Circuit is given that counts from 
00000 to 11111 and converts to serial 
Baudot, thus generating a pattern of all 
Baudot letters and figures. 

13. Whipple, Dick, and Arnold, John: "AVery 
. Cheap 10 — the Model 15," 73 Magazine, May 

1976, pages 77 to 80. Presents both software 
, and hardware interfaces. 

... > 

14. King, Walter S: "Interfacing the 60 mA 
Current Loop," BYTE magazine, August 
1976, pages 96 to 97. 


The New RTTY Handbook by Byron H 
Kretzman, Cowan Publishing Corp, 1962, 191 
pages, $3.95. Includes a brief history of 
Baudot . teleprinters and some . equipment 


RTTY Handbook by Wayne Green, Tab 
Books, 1972, 320 pages, $6.95. Many basic 
amateur radio operating circuits such as con- 
verters, exciters and FSK circuits. 



this is the case, information on interrupt 
interfacing techniques and hardware may be 
obtained from a variety of references, 
including microprocessor manufacturers’ 
application literature or any of the excellent 
programming books or publications avail- 
able (references 23b and 26) which deal with 
this subject. 


1. Thoroughly clean the machine with 
hot water. An ideal place for this is at 
a self-service car wash facility. 

2. Force air dry the machine with an air 
gun from, for instance, one available at 
the auto service station with the car 
wash. 

3. Spray all parts with a light lubricant 
such as WD-40. 

4. Lubricate as required with oil and 
grease. Details of these procedures are 
contained in references 1 through 6. 
Pay particular attention to lubricating 
the felt separating pads, which should 
be saturated. 

One thing I might add is to clean all 
electrical contacts, especially the keyboard, 
after the above sequence. A standard pencil 
eraser rubbed over the contacts until clean is 
an excellent way to accomplish this. 


Useful Miscellaneous Items 


lesearcning ror tnis article, I ran across an 
employee of a national teleprinter refur- 
bishing house for Model 28s, 33s, and 35s, 
who offered some interesting suggestions for 
cleaning a Model 15. He said that this 
procedure could be followed if one obtained 
an extremely dirty teleprinter “fresh” out of 
a damp basement. The steps are as follows: 


17. Pasco, Richard C: “Converter Lets Processor 
Drive Teletypewriter," ELECTRONICS, 
October 30 1975, pages 97 to 98. Simple 6 1C 
TTL circuit which accepts 5 or 8 bit parallel 
data and converts it to serial current loop 
pulses at any desired bps rate. 


Reference Data for Radio Engineers, fifth 
edition, pages 35 to 40, table 24: "Moore 
ARQ Code (Compared with 5 Unit Tele- 
printer Code)" and pages 35 to 38, table 22: 
"Timing of Five Unit Start Stop Teleprinter 
Codes." 


Steve Stallings, software courtesy of 
Richard Smith. Contains listings of an 
ASCII to Baudot Translate routine, a 
Teletype Demo Program which generates 
lines of ripple pattern on the page printer, 
and a Baudot Teletype Output Routine 
which accepts Baudot characters ' and 
enables outputting in Baudot directly, if 
called as a subroutine, rather than con- 
verting from ASCII to Baudot. 

(b) Volume 1. #9, February 1976, pages 11 to 
13, "Interfacing the Altair 8800, Part 2" 
by Hal Chamberlin. Includes at excellent 
discussion of simple single device hardware 
and software interrupts, vectored and 
polled interrupts, and an example of a 
keyboard interface using the interrupt 
technique. 


19. Whipple. Dick, and Arnold, John: "Inex- 
pensive Paper Tape Subsystem: Using 5 Level 
Tape with Computers," 73 Magazine, July 
1976, pages 145 to 148. Includes listings of 
paper tape read and punch software for 
Teletype Models 14 and 19. 


20. Whipple, Dick, and Arnold, John: "A Baudot 
Monitor/Editor System . . . program listing for 
the 8080," 73 Magazine, August 1976, pages 
102 to 105. Uses the software IO described in 
reference 19. Also uses split octal ad- 
dresses and octal data notations. 


24. Teletype Corp, 5555 Touhy Av, Skokie IL 
60076. 


25. Nettles, Walter: "Instructions for Hand 
Operated Typewriter Ribbon Reinking 
Device," reference 1 .Volume 1 2, #6, page 16. 

26. How to Use the Nova Minicomputers, by Data 
General Corp, 1971, Section 2.4 "Program 
Interrupt." Contains a discussion of interrupt 
servicing and priority and programming sugges- 
tions such as when to use interrupt. 


22. The Computer Hobbyist Group ( North Texas ) 
Newsletter, 2377 Dalworth 157. Grand Prairie 
TX 75050. 

fa) Volume 2. #4. April 1976, pages 7 to 13. 
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Ribbon Renewing: When the black print 
ribbon begins to fade or dry up, additional 
ink may be applied on the top and bottom 
of the ribbon to provide many more miles of 
heavy duty use from a single roll of ribbon. 
Also, the printing is much darker and easier 
to read on both page printers and typing 
tape reperforators. Inks suitable for this 
purpose include stencil ink, stamp pad ink 
thinned with wood alcohol (methanol), and 
mimeograph or duplicator ink thinned with 
either naptha or turpentine; see reference 
25. One ham I spoke with bragged of using 
3-in-l household oil as a thinning agent. 
Regardless of which type of ink is used, 
apply new ink sparingly until you get the 
knack of reinking. Allow at least 24 hours 
for the ink to soak into the ribbon and 
partially dry out before using on a printing 
machine. 


ii is possioie to gear the old equip- 
ment up to turn 100 wpm, but partic- 
ularly the Model 15 and 19 page 
printers do not last long at 100 speed. 
The Model 14 typing reperforator will 
do a little better, but still hops around 
when forced to run at 100 speed. The' 
Model 14 transmitter distributor' ap- 
pears to serve adequately at 100 wpm, 
as does the Model 14 nontyping reper- 
forator, if kept well oiled. 

“The older machines will run at 67 or 
75 wpm with a gear change .. .” 

The gear changes referred to are relatively 
simple to achieve. Faster gear sets may be 
obtained from most of the suppliers of 
Baudot teleprinters. Also, a full page table of ' 
gear set numbers required for various speeds 1 
on Model 1 4s, 1 5s, and 28s is shown on page-" 
1 1 1 of reference 4." '"va 


Increasing the Operating Speed of Baudot 
Machines: There have been many comments 
in the hobbyist literature regarding the 
effects of increasing the speed of the older 
Baudot machines. Of the more popular units 
such as Models 14, 15, 19 and 28, only the 
Model 28 and some Model" 14s were actually 


Never Toggle in a Bootstrap 
Again 

Never Flip Sense Switches 
Again 

Run Without a Front Panel 
Plug Into Backplane 
Use With Your Own ROM 
Completely Configurable 


The Automatic Boot Loader 


POWER-START 


RMQ Systems new Altair 8800-compatible POWER-START™ board is supplied with complete documen- 
tation and configuration information. Components are of the highest commercial quality, including double- 
sided printed circuit board with solder mask on both sides and silk screened legend, low-prof 

1C sockets, and gold-plated bus contacts. 

' ... POWER-START™ Kit (PS-2K) 165.00 Kit (PS-1K) with ROM 195.00 

Assembled (PS-2A) 265.00 Assembled (PS-1 A) with ROM 295.00 

To Order or for More Information: 

RMQSystems 

Research &DevelopmentDivision of 

computer kits, inc. 

the systems store 

1044 University Ave., Berkeley, CA 94710 (415) 845-5300 
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96 INTERFACE AGE 


STICKS 4-7 

ASCII, as a 7-bit code, is usually represented in 8 col- 
umns of 16 positions. The row positions are 0000 through 
1111, the low-order 4 bits, 0 through 15 in decimal. Thu 
columns are 000 through 111, the next higher 3 bits, '6, 
through 7 in decimal. For some reason, the developers 
of ASCII found it convenient to refer to these eight col-, 
umns as “sticks.” So shall we. Each position will be rep : 
resented in this article by its usual decimal representa- 
tion. For example, capital A is position 4/1. Figure 2 is a 
representation of ASCII that is more convenient to those 
working in octal, rather than hexadecimal, notation. 


Figure 2. 


MAY 1978 


The data alphabet called ASCII (Figure 1, page 98, and 
Reference 1), also has two other names— International 
Standard 646 (the ISO Code [Reference 2]) and Alphabet 
No. 5 of CCITT (the International Consultative Committee 
for Telephone and Telegraph). It is used throughout the 
world, incorporated in billions of dollars of equipment. 

But is it used correctly and wisely? Not always. There 
are misinterpretations, and gaps in definition that per- 
mit nonstandard usage. This article (in three parts) will 
give you the background, peculiarities, preferred prac- 
tices. and new developments for ASCII. You will find a 
lot of information not too generally known or realized; it 
should help in the correct and safe usage of ASCII. For 
additional help, you can reference the various national 
and international standards given in Table 1. Some other 
detailed articles are listed in References 3, 4 and 5. 
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ISO - International Standards Organization 

ECMA - European Computer Manufacturers Association 

ANSI - American National Standards Institute 

FIPS - Federal Information Processing Standard 

CSA - Canadian Standards Association 

BS - British Standard 

AS - Australian Standard { 

CCITT - Consultative Committee International, Telephone & Telegraph 
JIS - Japanese Industrial Standard 
GOST - USSR Standard Table 1. 
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OCTAL 

DIGIT 


The first positions of sticks 4 and 6 are respectively 
‘commercial at" and “accent grave.” Then the upper 
lower case Roman alphabets follow. This offset of one 
position is historical (from the United Kingdom), and of 
no importance as long as you remember that it is so. 

Following the alphabet in both sticks 5 and 7 are three 
positions each that one must be very cautious about. In 
ASCII they are assigned as [, /', and ] in stick 5 — {, | J , 
and } in stick 7. But in the ISO Code and CCITT versions 
they are reserved for national usage. Table II gives the 
national use assignment for these positions. Surely you 
remember that the Scandinavian alphabet has 29 letters, 

• t 26? My friend Orjar Heen in Oslo is very protective of 
ese positions. He says “If you Americans want to sell 
computers and software abroad, don’t use the ASCII 
characters for these positions in your software.” 

To be more precise, positions 5/11, 5/12, 5/13, 7/11, 
7/12, and 7/13 (noted above) are called primary national 
usage positions.- So is 4/0, where ASCII has the “com- 
mercial at.” Honeywell, for example, uses the "at” in its 
timesharing systems for deleting the previous character 
upon entry. But this isn’t too serious, because many na- 
tions also have the "at" in their primary sets. 
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Table 2. 


Also in sticks 4-7 are three diacritical marks. They are 
accent grave (') in 6/0, circumflex (") in 5/14, and tilde (~) 
in 7/14. These are called secondary national usage posi- 
tions. In some countries the tilde is a straight overline. 

But it is the circumflex where we have a lot of confu- 
sion. Teletype first made it an “up arrow” in an earlier 
version of ASCII, to serve as an exponentiation symbol, 
primarily for BASIC. But that doesn’t do very well, be- 
cause the exponentiation for FORTRAN is a double 
asterisk! The FORTRAN version is preferable in France, 
certainly, because they use such words as crane, cote, 
cout, and so on. 

A companion problem exists in position 5/15, with the 
underscore. The underscore is neither national nor dia- 
critical; all countries use it just as underscore (and for 
typesetting it is a U.S. convention to indicate italics, but 
in Italy it means boldface, except when it is the last 
character in a line! 3 ). But Teletype’s early version of 
ASCII used it as a “left arrow” — probably for an assign- 
ment symbol equivalent to : = in ALGOL. The up and left 
arrow have been carried over from Teletype into many 
video terminals. Ask your terminal manufacturer to cease 
and desist and retrofit. It’s not ASCII and will only cause 
trouble forever. 

The last character in sticks 4-7 is the Delete, symbol 
DEL, in position 7/17. It was put here because the binary 
code is 1111111, which would be all punched holes in 
perforated (not always paper!) tape, and that is the only 
way to make sure that it cannot be misread as some 
other character. ASCII is a complete set; all positions 
are assigned to have meaning. 

STICKS 2-3 

These are usually called the sticks for digits and spe- 
cials. Remember that they are the “digits” 0 to 9; not 
numbers, not numerals, not anything but digits! They 
are in 3/0 through 3/9 so that the low-order 4 bits are the 
representations for packed decimal. Originally we con- 
sidered the possibility of a special 4-bit set for numeri- 
cal applications (see the fifth entry in Table la), but it 
turned out that computer hardware became inexpensive 
enough to not deprive ourselves of the extra capabilities 
of the 7-bit and 8-bit sets. 
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Position 2/0 is officially called “space." I don’t and 
didn't like it, and would have preferred “blank." Which is 
why the IBM community often uses a lower case “bee” 
with a slash through the vertical as its symbol. From the 
Univac side, the space has the official symbol “delta.” 

Having mentioned packed decimal, where two digits 
qo into each 8-bit group (“byte" to the American, “octet” 
the French), a word of caution on the plus and minus 
-- ; g n s — they are in stick 2, rather than stick 3 with the 
digits. But the low order 4 bits are distinct, and + 
should be used only as 1011, - only as 1101. 1 mention 
this because the nonstandard code EBCDIC permits 
multiple representations of + and - in packed decimal. 
And the ASCII representations are not even coincident 
with any of these, with obvious dangers! 

Watch out for the “currency” positions, 2/3 and 2/4. 
They also have national variations. In ASCII they are cus- 
: marily # and $, but there are some things to be remem- 
;■ =d: 


■ - is not “number sign” for many countries, most of 
which use “No.” or “Nr.” for that purpose. And when it 
is ''number,” it must precede the digits, not follow. 

•# closely resembled the “sharp sign” in music. 

•# is “pound sign" only for the U.S., the only major coun- 
try still not using the metric system. To the rest, it’s 
kilograms. For now, it’s best to use the abbreviation 
“lb.” in the U.S., not the #. In any case, both must follow 
the numeral. 


■j the British, a “pound” has the symbol which is 
ny that is the symbol in position 2/3 for the UK. They 
j9t very irked when # is called a “pound” sign, espe- 
cially in software manuals. 

•The “dollar” is peculiar to the U.S., Canada, and some 
others. There are also francs, marks, escudos, pesos, 
lire. etc., etc. Which is why the ISO code uses the uni- 
versal currency symbol in position 2/4. It’s a circle with 
outside spikes at 45, 135, 225, and 315 degrees (O), 
called “scarab.” Table II also shows these assignments 
for several countries. 

• ECM A has provided a separate guideline for specifying 
nternational currencies. See the “Where to Get More 
information” at the end of this article. 


It’s a tough problem, and will get worse when we get in- 
to expanded character sets for photocomposition and 
such. For now, all we can do is follow the ASCII stan- 
dard, which says that # is a “number sign." 

Only a few more peculiarities remain for sticks 2-3. An 
important one is in the double quote, position 2/2, and 
the single quote, position 2/7. That is, you may think it is 
a single quote, and even use it so, but it is really an “ac- 
cent acute” for vowels. It slants from top right to bottom 
left, to complement “accent grave” in 6/0, which slants 
from top left to bottom right. Some terminal makers do 
not realize this pairing, and will have accent grave slant- 
ing correctly, but put accent acute as a single quote in 
the unstylized up and down method. My Terminet is one 
of those that is OK. 

Don’t forget that to the typesetter, in contrast to type- 
writers, both single and double quotes have two forms 
— opening and closing. In fact, the typesetter gets his 
double quotes by using two single quotes, of either 
form, because the quote uses very little space in vari- 
able space typesetting. Most terminals, either video or 
hardcopy, use constant spacing. So double and single 
quotes must be distinct for that reason. 

The last variation is in position 2/6, the ampersand. 
There are many legitimate different symbols for the 
ampersand. Neither ASCII nor the ISO Code prescribe 
any particular one. But this leads us to the next topic — 
how to represent the ASCII characters in handprinted 
form, so that they may be input to computer systems. 


HANDPRINTING FOR STICKS 2-7 

The classical confusion for many years was between 
the digit zero and the letter “oh," but there are other 
possibilities for confusion. American Standard X3.45 
specifies the handwritten character shapes shown in 
Figure 3. 


Z23 


n: 






Figure 3. 

This clears up a longstanding problem. The communi- 
cations types, and the armed services, used to put a 
slash through the zero; somehow the IBM users got to 
putting the slash through the letter “oh” instead, con- 
fusing the Scandinavians greatly. Now it’s neither (which 
helps), just a 180-degree rotation of the letter Q. The earlier 
German Standard DIN 66 002 prescribed the cursive loop 
in the upper right, as some may have learned in penman- 
ship courses. It now permits the ANSI form as well. 


UPPER AND LOWER CASE LETTERS 

Many people are accustomed to using upper case only. 
This is a hangover from early line printers and limited 
sets (until the Stretch computer of IBM, characters were 
usually 6 bits in size).- It would have been far better if 
they had all been lower case in those smaller sets. Put- 
ting it simply, would you buy a book to read if it were all 
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in upper case? Because lower case is much easier and 
faster to read, lower case should be the default case 
when one has only the one case. There is no reason why 
FORTRAN or BASIC processors cannot understand 
lower case variable names and verbs just as easily as 
they can understand upper case. 

I always recommend getting a terminal with both cases 
if it is at all affordable. Second best is making sure that a 
single-case terminal is retrofittable later, if necessary. 
And if a single-case terminal, get it in lower case only, if 
possible. There has been much reportage in the com- 
puter trade press about eyestrain resulting from using 
computer terminals. Is the reason obvious? 

STICKS 0, 1 

These are the control characters. The most important 
distinction in ASCII is the split between sticks 0-1, Con- 
trols, and sticks 2-7, Graphics. We'll see this later on in 
the standards for Code Expansion (to 8 bits or more), 
and Code Extension (alternate sets, such as Cyrillic for 
the USSR, and Kata Kana for Japan). 

Unfortunately, there is, despite the standard, much 
difference between the ways that various terminal de- 
vices handle these control characters. They may act dif- 
ferently, or they may not be operative at all. I have two 
very useful programs, written in the TEX language (Ref- 
erence 6). One lists each symbol by name and then 
shows its action between parentheses. The other asks 
you to depress in turn all the funny keys on your termi- 
nal, and then tells you what control character(s) they 
generate, if any. 

GRAPHICS FOR THE CONTROLS 

There are standard graphical representations for the 
32 controls, space, and delete. They are defined by ISO 
2047, American Standard X3.32, and ECMA-17, and are 
shown integral to Figure 1. Some terminals are adver- 
tised as ASCII terminals, and yet generate Greek or 
other characters for these positions. Don’t believe it! 
These symbols are every bit as useful as any Greek char- 
acters could be. 

There are five groups in the basic control set. 

STICKS 0, 1 — Logical Communication Control (10) 

This group is used for both communication and for 
labeling of media. It includes: 

SOH (0/1) (Start of Heading) — used as the first charac- 
ter in the heading of an information message. 
STX (0/2) (Start of Text) — terminates the heading just 
before the text. 


ETX (0/3) 


EOT (0/4) 


ENO (0/5) 


ACK (0/6) 
DLE (1/0) 


NAK (1/5) 


(End of Text) — Last character in the text 
message. Unfortunately, it is generated on 
many terminals via Control-C, and that's just 
to the right of Control-X on the keyboard, 
which is commonly used to cancel a bad in- 
put line. And if you mis-key — ouch! 

(End of Transmission) — the last character in 
any transmission, and usually it turns your 
device off! 

(Enquiry) — requests a response from a re- 
mote station, either an identification of that 
stations (Who are you?) or its status. 
(Acknowledge) — used by a receiver to reply 
“yes” to a sender. 

(Data Link Escape) — an Escape character, 
especially for communications, analogous 
to ESC (1/11). It signals the start of a charac- 
ter sequence that causes a shifting into 
another set of communication controls, 
whenever they are needed. 

(Negative Acknowledge) — used by a receiver 
to reply “no" to a sender. 


SYN (1/6) (Synchronous Idle) — needed by s Vnch 
nous transmission systems to get into n 
stay in, synchronization when no other si'inh 
signal is available to them. n 

ETB (1/7) (End of Transmission Block) — indicates thn 
end of some division of data that the tran«T 
mission system must make, unrelated to anv 
division in the format of the logical data itself 


One lists each symbol by 
name and then shows its action 
between parentheses. The other asks 
you to depress in turn all the funny 
keys on your terminal . . . 

STICKS 0, 1 — Physical Communication (4) 

This group is used for communications. It includes: 
NUL (0/1) (Null) — the standard says that it is “used" 
to accomplish media fill or time fill” . . . “may 
be inserted into or removed from a stream of 
data without affecting the information con- 
tent of that stream.” And that’s exactly what 
the standard also says about DELete (7/15), 
which it lists as a control character even 
though it is not in the control sticks! The only 
difference I can see between them is that on 
perforated tape you can make any character 
into a DELete, but none into a Null. t* 
CAN (1/8) (Cancel) — the receiver is to disregard the 
data received up to that point, starting from 
restart point that receiver and sender have 
agreed upon. It is common in timesharing for 
Cancel (often generated by a Control-X) to 
work on a line-at-a-time basis, to delete an 
unwanted string of entry characters, and ef- 
fectively put one back to the position of re- 
entering the entire line. In this case, the 
agreement between sender and receiver is 
“back to the last CR.” But there are many 
other ways that Cancel could be used, and fo 
parallel as well as serial transmission. 

SUB (1/10) (Substitute) — a character that says prob ; 

ably we would have had another character in 
this position if we could have figured out 
what it was supposed to be! There are many 
reasons for such confusion — perhaps parity 
didn't check out. But it is better to put in a 
SUB to keep the field lengths and such cor- 
rect. Moreover, note its symbol, a mirror im- 
age (not the Spanish inverted) question 
mark. If this is displayable, it will tell you 
definitively that the system doesn’t know 
what it is, and you can make a good guess in 
many cases, particularly in word text. 

EM (1/9) (End of Medium) — defines the previous 
character as the last usable character on 
that medium, whether or not there is more re- 
cordable space on the medium. 

STICKS 0, 1 — Device Control (11) 

This group is used for control of devices such as ter- 
minals. 

HT (0/9) (Horizontal Tabulation) — the standard says 
that is "advances” the active position to the 
next predetermined character position on 
the same line.” There are two ways this can 
work: 


HT (0/9) 


1 00 INTERFACE AGE 


MAY 1978 


by mchro- 
9e ; nto, or 
o other such 

ndh tes the 
at t i trans- 
slateu to any 
al data itself. 



action 
aei isks 
e funny 



)n (4) 

It includes: 
it i; ‘used” 
ill” .“may 
a stream of 
mation con- 
ixac / what 
ELe (7/15), 
irac.„r even 
ks! The only 
m is *hat on 
ly c tracter 
ull. 

sregard the 
tarting from 
sene r have 
esh ng for 
ontrol-X) to 

0 delete an , ; 
:ers ,nd ef- I 
sitii of re- 

5 c; a, the 
receiver is 
e are many 
sed ind for 
ssic 

says prob- 
:haracter in 
figi id out 
re a many 
rhapo parity 
to put in a 
d s ~h cor- 
a m or im- 
I) c 5stion 
/ill tell you 
esn’t know 
od t ess in 
tex 

e previous 
laracter on 

1 is i tre re- 


uch as ter- 

nda says 
itioi. -O the 
osition on 
/s t hi s can 


MAY 1978 





1. Right at the terminal, if it has the hori- 
zontal tab capability built in. Sometimes 
you can set the tab positions by using the 
terminal only; almost always the computer 
can be made to set the tabs on the termi- 
nal. Then when you hit HT during entry, or 
HT is read from the computer output, the 
printing or displaying (active) position will 
skip to the next tab setting. 

2. By a formatting program in the computer, 
which must be given some indication of 
the tab setting positions in force at any 
particular point in the file. The program 
then simulates horizontal tab movement 
by filling the lines with spaces as needed 
to achieve the alignment. 

VT (0/11) (Vertical Tabulation) — the standard says 
that it "advances the active position to tine 
same character position on the next prede- 
termined line." And if you agree with some- 
body else, it can be to the first position in 
that line instead. This is a very dangerous 
character to use. It cannot be used directly 
on any terminal that I know of. Even if it 
could, the implementation rules are not sup- 
plied unambiguously in the ASCII standard. 
And for use by a formatting program, one 
would have to predefine the number of lines 
to be skipped. That’s pretty tough when you 
are inserting and deleting lines, as every pro- 
grammer knows. 

LF (0/10) (Line Feed) — like vertical tab, but just to the 
next line, which is clean enough. If receiver 
and sender agree (again as in vertical tab), it 
can be to the first position of the next line, in 
which case it is called New Line (NL). Some 
manufacturers implement this. I personally 
prefer having a separate Carriage Return and 
Line Feed. Both codes can be generated with 
a single keystroke, and they often are. 

- iO/12) (Form Feed) — again like vertical tab, to the 
same cahracter position unless sender and 
receiver agree that it is to the first position in 
the new line, except that the tab is to a new 
line position that is related to a form of some 
size (those that fold 11 inches apart, for ex- 
ample). This control could run wild if your ter- 
minal or other display device is not equipped 
to handle it, so use it with caution in files. 

CR (0/13) (Carriage Return) — moves the active posi- 
tion to the first position on the same line! 
Not like typewriters. They have effectively in- 
corporated the New Line feature. But the 
■ non-advancing CR is better for terminals, 
even if it is misnamed. Neither video termi- 
nals nor ball and daisy wheel typewriters 
have carriages, so live with it. 

BS(0/8) (Backspace) — Backspace is a very tricky 
character. On some terminals, such as video 
terminals, there is no key to generate Back- 
space for entry into the text stream or buffer. 
On many it can be created via Control-H. Even 
then, it may or may not be operative. 

Backspace is meant for physical move- 
ment of the active position (which may or 
may not coincide with a cursor position, 
when such exists). Historically, it was includ- 
ed for hardcopy terminals and other hard- 
copy devices for some of these uses: 
•Underscoring (underlining). 

•Other forms of highlighting, such as bold. 
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For example, the sequence A BS A BS A 
would strike the A three times on a hard- 
copy device, and make it look boldface 
(such a sequence can also be translated to 
call a boldface font in photocomposition). 
•Editing indications. For example, in legisla- 
tive bill drafting to indicate the deleted or 
changed portion: 

This is obsolete. 

•Forming composite characters, e.g.: 

§ ± :£ f 2 I } F (Hungarian forint) 
•Forming accented letters, primarily for 
European languages. Examples: 

A A ©(Scandinavian letters following Z) 
r*r i *> a • • 

N a a o u 

Warning: Backspace is entirely different 
from a cursor movement on a video terminal ! 
When the cursor is moved to a position 
where a character is already entered, suc- 
ceeding entry in that position usually 
destroys the original character and replaces 
it with the new entry. 

I personally haven't seen any video terminals 
with a true backspace. A former president of 
Infoton told me it could be done as an engi- 
neering special for about $5,000 one-time cost. 

Warning: There are three ways to create 
underscored text for hardcopy terminals: 

1. The characters, that many backspaces, 
and that many underscores (or vice versa). 

2. A character, BS, underscore, the next 
character, etc. This is called the canonical 
form, and is used quite commonly. 

3. Underscore, BS, character, underscore, 
etc. 


I have noticed a lot of difficulty moving back 
and forth between hardcopy (at my home) 
and video (in my office) terminals. One tends 
to underscore on the hardcopy terminal and 
forget that half of the pairs are going to be 
wiped out by the cursor on the video termi- 
nal. In the first two methods above, it’s the 
text that gets wiped out, and it’s hard to read 
on the fly. So if you plan to display a file on a 
video terminal, find another highlighting 
method, or use the third underscoring con- 
vention. Even that may give problems if done 
by embedding an underscoring command in 
the file you pass to a formatting program; 
most such programs put the underscore last 
instead of first. 

BEL (0/7) (Bell) — sounds an audible signal to get the 
user’s attention. Some terminals are not so 
equipped, but they should be. It’s good 
human engineering. But please give me an 
adjustable volume control ! 

And then there are the four device controls for unspeci- 
fied purposes, DC1, DC2, DC3, and DC4 — in positions 
1/1 through 1/4. Different manufacturers treat these like 
a wild card in poker — they make them anything that they 
want. Doesn’t lead to much compatibility, so beware. 


STICKS 0, 1 — Field Separators (4) 

This group is used for formatting and string process- 
ing. These are the separators in positions 1/12 to 1/15. I 
got the idea originally from the Word Mark in the IBM 
1401, which used an extra bit in the low-order character 
in a field as a delimiter. ASCII uses special and separate 
characters to indicate a hierarchical structure. Originally 
I put in eight such characters, but only these four remain: 
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FS(FileSeparator —1/12) 

GS(GroupSeparator —1/13) 

RS (Record Separator — 1/14) 
US(UnitSeparator —1/15) 

FS is most inclusive, US the least inclusive. And we can 
consider the blank/space as the next lower order separa- 
tor from these. Suppose we had a line of text like this: 
(text1)US(text2)US(text3)RS(text4)US(text5)GS(text6) 

On many terminals these delimiting control characters 
would not print, so we would see only a continuous' stream. 
On others they might show as spaces. A TEX command 
to break the line at the record separator would be: 

scan:line:"rs 

The variable ‘left would contain “(textl) . . . (text3)”. The 
variable 'right would contain “(text4) . . . (text6)”. 

STICKS 0,1 — Changing Sets (3) 

This group is used for moving to and from alternate 
graphic and control sets. This includes ESCape (1/11), 
Shift Out (0/14), and Shift In (0/15). 

These basic control characters have permitted design of 
a quite marvelous structure for extension and expansion, it 
allows us to code and classify most of the world’s graphic 
symbols for computer storage, interchange, and display. 
This big area will form most of Part III of this article. 

IN THE NEXT INSTALLMENT 

The ASCII Collating Sequence 
ASCII and Programming Languages 
ASCII and Media 
Keyboards 

ASCII and Display/Printing 
Code Extension — Alternate Controls 
Code Extension — Alternate Graphics 
ASCII and Non-Latin Alphabets 
Code Expansion — 8-bit ASCII 

WHERETO GET MORE INFORMATION 

There are four sets of Information Processing Stan- 
dards that may be of concern to you: 

•ISO. Sold only through ANSI (American National Stan- 
dards Institute), which has the franchise. That makes 
the prices high — much higherthan in other countries. 
•ANSI. These are American National Standards devel- 
oped via the X3 and X4 committees, mostly. Prices still 
pretty high. ■' 

•ECMA (European Computer Manufacturers Associa- 
tion), 114 Rue du Rhone, 1204 Geneva, Switzerland). 
Free, and they have a lot more advanced standards 
than ISO and ANSI. But a modest donation would not 
be unwelcome. 

•Your friendly U.S. Government, in the person of the 
Department of Commerce, National Bureau of Stan- 
dards, Institute for Computer Sciences and Technol- 
ogy, in Gaithersburg, MD 20760. If by any chance you 
are employed by the U.S. Government, you get FIPS 
PUBS (Federal Information Processing Standards Pub- 
lications) for cheap. Otherwise, see ANSI. (Refer to 
Tables la, 1b, and 1c). In many cases they are essen- 
tially reprints of the ANSI standards, for a fraction of 
the cost. 

If you can’t wait for the standards to be approved and 
published, catch them in progress. Ask CBEMA, the 
sponsor of ANSI X3, to put you on an observer list for 
the committee in your area of interest. The address is: 
Robert Brown, Director of Standards 
Computer & Business Equipment Manufacturers 
Association 
1828 L Street NW 
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Washington, D.C. 20036 
(202) 466-2288 Telex 89 29042 
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FOOTNOTES 

' For those curious about the reverse slash, it came from ALGOL 
58. The reference language specified A and V as the symbols 
for AND and OR respectively. I put the reverse slash In so 
these could be made as 2-character groups — and 
2 You will still see many terminals where this vertical bar is 
broken in the middle. This resulted from a hassle with the PL/I 
people, who wanted to stylize the exclamation point (2/1) as a 
vertical bar for OR in that language. And of course that would 
make the graphics the same. The compromise (at horrendous 
cost in people time) was to break the real vertical bar in ASCII, i 
But it turned out that the PL7I people didn’t really need it, or 
else it gained no momentum, so the real vertical bar is back to t 
normal in ASCII-1977. Let’s fix those terminals. 

3 The Italians also have a different solution to hyphenation and 
right justification. It ignores the syllable structure and simply 
demands that if, when you get to the last position in the line, 
the current word is not yet completed, that last character shall 
be underscored, and the word continued without fuss on the 
next line. I rather like it. - 
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The abstract aspects of ASCII were treated in Part I. 
Now we come to some aspects of usage and implementa- 
tion. Certainly one major use area is the ordering of files. 

THE ASCII COLLATING SEQUENCE 

To put items in some ordering, the entire precedence 
relationship for that ordering must be defined. Higher or 
lower, precedes or follows, or whatever. For single char- 
acters, this ordering relationship is called the “collating 
sequence”. 

The ASCII standard used to say that the collating se- 
quence for both graphics and control characters is 
defined simply by their binary representations. Later it 
added a warning that this collating sequence “cannot 
be used in many specific applications that define their 
own sequence”. What an understatement! 

The 1977 version hedges and speaks all around the 
problem without making it clear. It’s not all that difficult. 
Suppose you have two files, and you want to know how 
they differ and/or how they are the same. For this pur- 
pose, the implied collating sequence (straight binary 
comparison) is just fine. The two files will be in the 
same order, and can be matched. 

Whether that straight binary ordering can be used for 
any other purpose is doubtful. It won’t work for signed 
numbers. 

Ordering Numerals 

Take these four values: 22, 13, minus 6, and minus 31. 
If the sign is placed before the digits, ordering by the 
ASCII collating sequence yields: 

+ 13 
+ 22 


because the complete decision is made in the leading 
digit. Again, a worthless sequence. 

The way to achieve a proper ascending sequence is to^ 
separate the values into two groups, ordering those with? 
plus signs in ascending sequence, and those with minus 
signs in descending sequence. Then put the plus group? 
following the minus group. And vice versa for a total de- 
scending sequence. Notice that this works regardless) 
of whether the sign precedes or follows the digits. a 

Ordering Alphabetic Fields 
Alphabetic ordering is even more complex, particu-1 
larly in handling both upper and lower case. Again the 
implied ASCII collating sequence can go wrong. People; 
who have not studied the collating problem for data con; 
taining both upper and lower case are inclined to jump) 
to wrong conclusions. I did myself, for the IBM Stretch: 
computer in 1958, assigning the ascending binary se* 
quence as AaBbCc. Using this for a telephone directory, 
would give us the lefthand column. The straight binary 
sequence of ASCII would yield the righthand column* 
just slightly different: JjSj 

De Carlo De Carlo 9 

De La Rue De La Rue } 

De Long De Long 

DeLair DeLaRue 4 

DeLancey DeLair 

DeLaRue DeLancey 

Delancey Delancey 

de Carlo de Carlo 

de la Rue de la Rue 

deLancey deLancey . 

Either version will get a lot of anguished subscribers! 

In the simplest case, two alphabetic items must be 
compared with the case ignored. Only if they are then;; 
equal is case called into consideration to break the tie J 
and it is also applied successively left-to-right! 

In short, the upper and lower case versions of a letter: 
do not both get full graphic significance. Typing eitherf 
“Y” or “y” will indicate a "yes” reply, but “N” will not.i- 
Because the case distinction Ts minor, comparisons: 
must first be madeon major distinctions, with the minor: 
distinctions used only as tie-breakers. Accenting of let- : 

■ ■■ '.v ■'■-£+ i„- 
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This is obviously worthless. It’s because ordering is 
decided left to right, and the minus sign has a binary 
value 2 higher than the plus sign. Or if the sign were to 
follow the numeric values we would get: 
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ing problem. In the days of punch cards, before com- 
puters, one often used several card files related by a 
key. A sorter (with pockets for the cards to drop into) 
might be used to select the cards for all redheaded 
females between 1 8 and 24 years of age. But these cards 
would have only the employee number and such charac- 
teristics on them. To get the name, address, and tele- 
phone number one might have to go to a second (re- 
lated) deck of cards. So the first deck (the subset of in- 
terest) would be placed in the first hopper of a collator, 
and the deck with all names and phone rtumbers in the 
second hopper. Then a card would be fed from the first 
hopper, followed by successive cards from the second 
hopper, until a match was found on employee number. 
Obviously both decks had to be in the same ordering for 
this to yvork, and thus the term “collating sequence”. 

In effect, we were sticking the cards of the first deck 
upright just in front of the corresponding cards of the 
second. To do this with ASCII requires that we have 
characters that collate lower than the lowest graphic, 
the space (2/1). We do have them. The best to use are 
NUL, FS, GS, RS, and US. Put one of these after each 
search key, then put the two files together and order 
them as adjoined. Now those records having a search 
key' with one of our five control characters appended 
will precede the corresponding record having an ASCII 
graphic following the key. 

Note that the four information separators (FS, GS, RS, 
US) are designed to collate just behind Space, in that 
order. This contiguity means that they can be used as a 
hierarchy of spaces of different class. 

Other Collating Features 

ASCII was designed when there was substantial in- 
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vestment in files already ordered on a Topsy-class IBM 
sequence, where the basic punctuation was low to the 
alphabet, but the digits were high to it. How then to ac 
commodate this and still provide a 4-bit subset? My 
morning shower provided a solution (it still does!). 

The 4-bit subset is formed of the first 10 graphics of 
stick 3 (the digit graphics) and the last 6 of stick 2. This 
jog was shown shaded in the early forms of ASCII, but 
has all but disappeared from memory now. It enables 
stick 3 (with the digits and new special graphics) to be 
ordered high to all the others via passive logic, thus 
overcoming opposition to the adoption of ASCII. 

ASCII AND PROGRAMMING LANGUAGES 

Standard ECMA-53 (1978 Jan), “Representation of 
Source Programs for Program Interchange,” gives the. 
subsets and/or modifications of ASCII as they are used 
for these five programming languages (Footnote 1): | 

NO. OF CHARACTERS USABLE 
Language Subset of ASCII Other 

APL 57 32 

Minimal BASIC 60 0 

COBOL 51 0 

FORTRAN 49 0 

PL/I 55 2 

Figures 1 through 5 are the character sets for these Ian-' 
guages as given in ECMA-53. They show the only charac- 
ters permissible for use in source programs, except for: 
non-numeric literals in COBOL 

comment-entries • ” :'f 

comment lines ” 5 

character constants in FORTRAN 

comments ' ” 
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BBBDB BBBBBBBB 

BDBBB BBBBBBBB 

DDDDB BBBBBBBB 


Figure -L . 

character-string-constants in PLi 1 

comments 

For these purposes only, other ASCII characters may be 
used, providing there is agreement between the sender 
and receiver for any interchange of source programs. 

The TEX language has gone farther than this general 
caution. There the specific characters have permanent 
names. For example, one could say: 

linefeed = “ 

i* (actual line feed inside. the quotes) 
if lf:eqs:linefeed .... 

and it would be true, because “*lf" is the permanent name 
of Line Feed. The control characters have names that are 
the letters from the ASCII chart, preceded by the aster- 
isk to show that they are read-only variables with perma- 
nent content. TEX can in fact operate upon all 256 char- 
acters-of ASCII in an 8-bit byte, all 512 in a 9-bit byte. 
Specific Notes on the Figures 
APL -Sticks 6 and 7 (ordinarily lower case alphabet) 
are replaced entirely except for the DELete 
position. 

— . -Space is nonprinting, although the symbol 
shown is SP. 

-Ampersand (216) is not used for writing source 
programs, except as the last character of a 
line if that line is to be continued on the next 
line. 

PUI -In position 2/1, the exclamation point is re- 
placed by a vertical bar for OR. 

-In position 5114, the circumflex is replaced 
by the symbol shown, for NOT. 

-If you have to use your terminal for both PLII 
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Figure 5. 

and some other programming language, 
forget that foolishness. You can get by with 
the exclamation point as OR, and the circum- 
flex as NOT. The important point in source 
program interchange is to have the encoded 
representations of the characters exchanged 
correctly. ^ • ‘ 

(all) -Although the character BLANK (space) is 
shown as the flagged lower case “b" in the 
FORTRAN and PL/I sets, there is no printing 
graphic to indicate it. For all practical pur- 
poses, it is really the Space of ASCII (210). 

-Four of these five languages (not APL) have 
the shown in 2/4. When the International 
Reference Version of the code is used, this 
l becomes the universal currency symbol, 

which is also acceptable. 

-Minimal BASIC uses "ft”, which is the Inter- 
national Reference Version symbol. The na- 
tional symbols, such as the English pound 
sign, are also acceptable. 

ASCII AND MEDIA 

ASCII and Punch Cards 

Reading and punching equipment for punch cards, be- 
ing very mechanical, is so expensive that microcomputer 
people are unlikely to use them. So you might ask why 
we bother here with the representation of ASCII on this 
medium? I can think of at least three reasons: 

•A scientist at the U.S. National Bureau of Standards 
said once that if punch cards were on the way out, it 
was the only product he ever saw dying on an upward 
usage curve. Thus they are likely to be around for a long 
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Hollerith Punched 
Card Code 

Track Assignment - 
25.4 mm Perf. Tape 

Track Assignment - 

12.7 mm Mag Tape 
200 cpi NRZI 9-track 

Track Assignment - 

12.7 mm Mag T ape 
800 cpi NRZI 9-track 

Track Assignment - 

12.7 mm Mag T ape 
1600 cpi PE 9-track 

Track Assignment - 

12.7 mm Mag T ape 
6250 cpi GCR 9-track 

Labe li ng 8 File 
Structure - 12.7 mm MT 

Track Assignment - 
Magtape Cassette 
3.81 mm, 32 bpmm 


Labeling 8 File Struct. - D I S 41 
3.81 Magtape Cassette 4341 


ISO 

ECMA 

ANSI 

FIPS 

PUB 

CSA 

BS 

AS 

1679 

2021 

44 

X3. 26-1 970 
$4.25 

14 

Z243.14 

.36 

4636/3 
/ 4 

1063 

1113 

10 

X3 .6-1965 
$3.00 

2 

Z243.8 

3880/3 

1062 

1862 

5 

X3. 14-1973 
$3.25 



3968 

1008 

962 

1863 

12 

X3. 22-1973 
$3.75 

3-1 


4503/1 

1009 

3788 

36 

X3. 39-1973 
$3.75 

25 


4503/2 


DP 

5652 


X3. 54-1976 
$5.25 

50 




1001 

13 

X3. 27-1977 
(unpriced) 


Z243.7 

4732 

1068 

3275 

3407 

34 

X3. 48-1977 
$5.75 

51 


5079/1 



AS CCITT J IS 


Track Assignment - DIS 

6.35 mm Cartridge Tape 4057 
64 bpmm PE 


DIS 46 X3. 56-1977 

057 $4.24 


Table la. Standards for ASCII on Physical Media. 



ISO 

ECMA 

ANSI 

FIPS CSA BS 

AS CCITT J IS 

GOST [ 





PUB 



Bit Sequencing in 



X3 .1 5-1 976 

16-1 

V .4 


Serial Transmission 



$3.00 


X .4 


Char. Structure 8 Parity 



X3. 16-1 976 

17-1 

V .4 


Sense - Seria l-by-Bi t 



$3.50 


X .4 


Char. Structure 8 Parity 



X3. 25-1976 

18-1 

V .4 


Sense - Paral le l-by-Bi t 



$3.50 


X .4 


Procedures for Using 

1745 

16 

X3. 28-1976 

Z243.13 4505/1 

1484/1 


Commun. Control Chars. 



$10.50 




Message Heading 

1745 


X3. 57-1977 




Formats 



$5.25 




Advanced Data Commun. 



BSR 




Control Procedures 



X3.66 





Table 1b. 

Standards for ASCII in Communications. 
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time, and you may need to transfer some of those files 
to other media that you do use. 

•There is some likelihood that microcomputers couid be 
used in the reading and punching equipment itself, to 
make it less expensive. 

•ASCII users are going to be confronted for a while yet 
with one of the several versions of IBM’s EBCDIC, and 
the punch card assignments provide the only legitimate 
link for conversion of EBCDIC files to ASCII. 

So Figure 6 defines the hole patterns for the binary en- 
codings. And Figure 7 defines the encodings for the 
hole patterns. Don’t worry about the inconsistency in 
the relationships. Nothing can be done about it now, 
because it started with Herman Hollerith’s first U.S. 
Census machines in 1890. At first only digits and + and 
- signs were used. Then the code was expanded to the 
upper case alphabet. And other special characters for 
commercial use. When FORTRAN came along in 1964, it 
turned out that the limited capability of the subset of a 
6-bit set would not permit the graphics needed for scien- 
tific work. For a long while there were dual graphic rep- 
resentations for several of the punch card code combina- 
tions, and this carried over into printer chains, and so on. 

The only logic that the patterns follow is that they do 
or do not have a punch from among these six possibilities: 
12-punch (top row) 

11-punch (next to the top row) 

0-punch 

8- punch 

9- punch (bottom row) 

a punch from among the digits 1 through 7 
Including the no-punch-at-all combination (NUL), this 
gives 256 combinations, just right for the 8-bit code. 
Although ASCII was technically only a 7-bit code at the 
time this rule was formulated, it was felt necessary to 
plan ahead a little. 

ASCII and Magnetic Tape 

Figure 8 gives a compact representation of several 
relationships, among which is the assignment of ASCII 
bit pattern to 9-track magnetic tape. The jumbled assign- 
ment may remind you of the “firing order" for the cylin- 
ders of an automobile engine. In fact, we used to call it 
just that. It was intentional for increased reliability. As 
in so many cases, better technology has removed the 
need for peculiar design, but the assignments are un- 
changeable because of data file investment. 

There is no parallelism in recording and reading on 
cassettes and cartridges. The ASCII bits are recorded 
serially in the track. Thus Figure 8 does not consider 
these media. 



ASCit and Communications 
Not only is the topic of ASCII and communications a 
very complex and large dissertation for this article — it 
is also undergoing substantial rethinking, enlargement, 
and invention. You will have to follow on your own the 
workings of the CCITT, the various networking systems 
of the several large and many small manufacturers of 
computer systems, and the offerings of the common car- 
riers — either on the local distribution system (via ATT) 
or direct distribution (via Satellite Business Systems). 
Many of the existing standards are listed in Table 1b. 


Many more are under development. Arguments are rag- ^ 
ing internationally on the merits'of packet switching, *jt 
byte protocols, value-added systems, open-working sys- } 
terns, tariffs, data movement across national borders, 1 
the X.25 protocol, etc., etc. ATT is offering a new service 
because they suddenly discovered data-undervoice 
(DUV). All I can tell you now is that it is all based upon ^ 
ASCII, and the proposed protocols are all dependent 
upon the ASCII control characters in stick 0 and 1. It will 
take years for this to shake out, and for now all one can 
do is get on the CBEMA mailing list (see reference, Part 
I, INTERFACE AGE, May, 1978). 

ASCII AND THE METRIC SYSTEM 

The full ASCII graphic set (both cases) is sufficient to '■ 
indicate all symbols and prefixes of the SI (International 
System of Units, the new metric system), with three ex- '' 
ceptions. They are the Greek letters “omega” for “ohm”, ? 
and “mu” for “micro”, and the degree symbol for Cel- ! 
sius temperature. These three characters will be provid- ; 
ed in 8-bit ASCII (see Part III, next month). Meanwhile, ; 
for these, and also for such equipment that has only a 
single case, there is a standard way of representing the 
SI units and prefixes. This is given in International Stan- 
dard 2955, “Representations of SI Units and Other Units : 
for Use in Systems with Limited Character Sets”, and \ 
also in American Standard X3.50-1976. 

To keep the record straight, let’s first look at the char-'-J| 
acters used for the prefixes. They’re shown in Table 2, J 
which indicates multiples from 10 to the - 18 up to 10 to ' : -4 
the + 18: 


10 + i 

exa (E) 
peta (P) 
tera (T) 
giga (G) 
mega (M) 

kilo (k) 
hecto (h) 
deka (da) 


10 - i 

atto (a) 
femto (f) 
pico (p) 
nano (n) 
micro (n) 

milli (m) 
centi (c) 
deci (d) 


Table 2. Metric Prefixes 


Above 3 there are no powers except multiples of 3. This 
practice breeds better comprehension, like marking off 
three’s in writing numbers of many digits. Also, as a 
memory convenience, all symbols are upper case for 
powers greater than + 3. And there are no conflicts with 
the symbols for the units of measurement. 

Now, again for the record, here are the ASCII charac- 
ters) used as symbols for the units: 


A 

ampere 

cd 

candela 

Bq 

becquerel 

d 

day 

C 

coulomb 

g 

gram 

°C 

degree Celsius 

h 

hour 

F 

farad 

1 

litre 

Gy 

gray 

Im 

lumen 

H 

henry 

lx 

lux 

J 

joule 

M 

micro 

K 

kelvin 

m 

metre 

N 

newton 

min 

minute (time) 

S2 

ohm 

mol 

mole i - 

Pa 

pascal 

rad 

radian 

S 

siemens 

s 

second (time) 

T 

tesla 

sr 

steradian 

V 

volt 

t 

tonne/metric ton / 

w 

watt 


megagram 

Wb 

weber 




Table 3. 

Metric Units 


Table 3 shows the rules clearly. Units not named after 
people are all lower case, as-shown in the righthand col- 
umn (although I do know a Mr. Day). In the lefthand col- 
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umn are the units that are named after people. The 
names of the units are not capitalized at all, but the sym- 
bols begin with an upper case letter. 

I said previously that there were no conflicts between 
unit and prefix symbols. But you’ve probably noticed “d” 
for both “day” and “deci", “h” for both “hour” and “hec- 
to”, “m” for both “metre” and “milli”, and “T” for both 
“tesla” and “tera”. OK. But there isn’t any confusion in 
actual usage, because the prefix precedes the unit: 

dd is a deciday (2.4 hours) 
hh is a hectohour (100 hours) 
hH is a hectohenry (but don’t ever use the term) 
mm is a millimetre 

Mm is a megametre (1/300 the speed of light) 

TT is a teratesla (Wow!) 

I am not suggesting that the prefixes should be ap- 
plied to other than the primary metric units (the second 
is the primary time unit; hour and day are not), even 
though the timesharing system I customarily use 
figures my time in millihours. But when you get accus- 
tomed, the prefixes are very valuable in other ways. For 
example, an American b'illion is a kilomillion, whereas 
the British billion is a megamillion! And my metric teach- 
ing program understands such things as kilofathoms. 

The “space” character is also vital to correct SI usage. 
It must occur between values and units, like 123.6 mm, 
and 22 °C. 

And don't forget another peculiarity of ASCII as an in- 
ternational alphabet: (1/14) is absolutely not defined as a 
“decimal point” (nor is it defined as “period”, which in 
the United Kingdom is “full stop"). For most of the rest 
of the world, the comma (1/12) is the decimal marker, and 
the period is used to mark off threes. That’s why the recom- 
mended practice for marking off threes is to use the 
space, not either comma or period. E.g., “1 234 567 mm". 

To save you the bother of looking up the standards for 
use with limited character sets, here is the algorithm: 

1. If you have ASCII with both cases of alphabet, the 
three missing symbols are handled as: 

ohm for 52 

Cel (initial cap) for °C 

u (lower case) for \i (micro) 

2. if you have only one case of alphabet (either upper 

or lower), use it, and these three replacements 
remain as: 



I# 

2 3 


And in addition: 


S (siemens) 

SIE 


sie 

h (hour) 

become HR 

or 

hr 

t (tonne) 
Examples: 

TNE 


tne 


16 UOHM is 16 nS 2 

373.15 K = 100 Cel 

Notice that no plurals are used in symbol combinations 
— MICROOHMS, but UOHM. 

ASCII AND KEYBOARDS 

Technically, a keyboard is an ASCII keyboard if it gen- 
erates the proper codes for the full set of ASCII graphic 
and control characters. Moreover, none of the graphic 
characters should have any control properties. 

There are many types of special keyboards — Dvorak, 
a two-sided one used like an accordion with the hands in 
a vertical plane, Touch-Tone and its derivatives, etc. 
There are no formal standards to relate these keyboards 
to ASCII. For typewriter-style keyboards, however, there 
are two versions given in the American National Stan- 
dard. One is derived from the usual electric typewriter 
keyboard, the other is called the “bit-paired” keyboard. 
Only the bit-paired keyboard will be shown and discussed 
here, because the other form is the subject of proposals 
for extensive change due to the growth of Word Pro- 
cessing. ANSI Committee X4A12 is studying this now. 

The bit-paired keyboard was designed for minimum 
circuitry cost. Thus the "at” symbol (4/0) is paired with 
the accent grave (6/0), “A” (4/1) with ”a” (6/1), and “ + ” 
(2/11) with (3/11). Thus the shift key affects each 
other key by only a 1-bit change. 

This keyboard is shown in Figure 9. It is the inter- 
change keyboard of ECMA-33. The numbered arrows key 
to the notes on changes that would make this ECMA 
keyboard into the ANSI keyboard for ASCII. It is also 
equivalent to the keyboard of ISO Standard 2530-1975 
(Footnote 2). 

Notes for Figure 9 

1. For and ANSI keyboard, this key is put to the 
right of the circumflex key, on the top row (see 
Note 6). The Shift Key is put in its place. 

2. If this key exists and is available, the ECMA and 
ISO standards put the underscore here, remov- 
ing it from the “zero" key. 

3. The ANSI keyboard of course puts a “$" here in 
place of the international currency symbol. 

4. This is where the underscore is removed for the 
48-key keyboard (see Note 2). 
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5. Here ECMA and ISO show the “ overline ” instead 
of the “tilde." It's a question of styling. 

6. The ANSI keyboard has the reverse slash and 
vertical bar here, rather than between the shift 
key and “Z" (see Note 1). 

7. The ANSI keyboard specifies the underscore 
here, in both shifts, rather than the positions 
shown as options in Notes 2 and 4. Practically 
no keyboards follow this. In fact, as I am enter- 
ing this text, this is the only key where my Info- 
ton Vistar deviates from the ANSI standard. It 
has Line Feed there, with Return to its right — a 
very sensible arrangement. 

Customarily, the Control Key is also tied to bit-pairing in 
such keyboards. The standards recommend that charac- 
ters created in combination with the Control Key should 
use the graphic key in sticks 4 or 6 units higher. Thus 
“X” (5/8) or “x” (7/8) in combination with the Control Key 
produce CAN (1/8). Unfortunately this also means that 
Control-C generates ETX (0/3). And whereas Control-X as 
CAN is used frequently, to erase an input line of text, 
ETX is not often wanted. Yet it is a common miskeying 
to hit C rather than X. In many timesharing systems you 
will get a disconnect rather than a line delete. 

Ccnlrci and Function Keys 

The so-called “QWERTY” arrangement is prevalent 
throughout the world. Even the French “AZERTY” set is 
being considered for change. But on top of these basics 
there are hundreds of keyboard varieties. Some of them 
have “dead keys” (i.e., the platen or printing element is 
not advanced when they are hit). This avoids having to 
use BS for accented letters, but it also creates difficul- 
ties in code generation. 

There are some general good practices that ASCII key- 
boards should follow. To facilitate usage by those exper- 
ienced with typewriters, all controls not used with type- 
writers should be located outside the customary touch- 
typing area. As a specific example, the Break/Interrupt 
key should be located where it is a definite effort to 
reach it (not mixed in with the keyboard). ISO 3244 may 
be consulted for these considerations. 

Function Keys are those that generate sequences of 
more than one ASCII character. Examples are cursor 
keys, Erase-to-EOL, etc. They should be located in spe- 
cial clusters. Most importantly, they must all generate 
ASCII codes for transmission when in character-at-a-time 
mode. I know of video terminals where the cursors do 
not generate codes, as they should not while in full page 
buffered mode; but they still operate in line mode with- 
out generating codes. In this case the screen is alter- 
able, but there is no way of detecting it in the computer. 

Many keyboards will have some function keys that are 
unlabeled, for do-it-yourself assignment. These should 
also be clustered separately, and generate code se- 
quences when in line mode. 

ASCI! AND DISPLAY PRINTING 

When ASCII characters are displayed, it may be on a 
video screen, paper, or COM (microfiche). 

On the video screen there are a number of methods to 
form the characters, mostly at the manufacturer’s pref- 
erence. They are usually at pica (constant-width) spacing 
for economy, so an approximation of graphic quality 
(such as typesetting) is not obtainable. When lower case 
is available, the risers and tails extend above and below 
the line for some screens. In others, they fall within the 
boundary lines of the upper case characters. They may 
be shown in inverse video (light background block), or 
highlighted by different brightness or blinking. Controls 
for this work will be taken up in Part III of this article. 

For paper copy one usually finds either direct impact 
of a formed letter, or stylus printing. Either method is 
suitable to proportional spacing if desired. Recently 
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there has been a general trend toward using the 7x9 dot 
matrix shapes of ECMA Standard 42 for stylus printers 
This set of graphics is shown in Figure 10. 

For hard print elements, of course, one can get a near 
ly infinite variety of styles and fonts. There are only two 
however, specifically associated with computers — 
OCR-A and OCR-B. “OCR" stands for “Optical Charac 
ter Recognition”, meaning that the shapes are so styled 
that a computer-controlled scanner can read the charac- 
ters as printed on paper, and encode them directly from 
their shapes. 

OCR-A is not suitable for human reading. It’s the fun 
ny looking one with the diamond-shaped letter “Oh.” I 
won’t dignify it by showing the font here. It was thought 
formerly, with technology of that day, that making 
humans work harder to read letters would make it easier 
and thus cheaper for computers to read them. This argu 
ment turned out to be specious, and with today’s tech 
nology there is no need to use anything other than OCR-B 

OCR-B is specified in ISO 1073/2, ECMA-30, and ANSI 
X3.49. It is the font shown in Figure 11.1 have it on my 
IBM golfball typewriter at home, and on my daisywheel 
element at the office. So it should be available for most 
hard elements, including the carousel type. 

The first six rows correspond to ASCII sticks 2-7. In 
the first row, the pound and universal currency symbol 
are for replacement as needed. In the fourth row, the 
underline is discontinuous; a continuous form is shown 
in the auxiliary set. This set also contains a matching ac 
cent acute instead of single quote, the real circumflex 
(not an up arrowhead), a cedilla, and an “m” of better 
proportion. □ 
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Footnotes 

1. With the distribution, ECMA said “ECMA-53 is an attempt to 
improve portability of programs. It links the language char 
acter sets defined by the language standards, their coded 
representatives by means of the 7-bit code and the imple- 
mentations on data carriers (punched tape, punched cards, 
magnetic tape and magnetic tape cassettes and cartridges) 
It is a standard of a new type in which already standardized 
features are assembled in a new standardized combination 
aimed at supporting interchange and decreasing implemen 
tation dependency.” 

2. ISO 2530 is for the alphanumeric area of the keyboard only 
It is augmented by ISO 3243-1975 — Keyboards for Coun 
tries whose Languages have Alphabetic Extenders, Guide- 
lines for Harmonizations, and also by ISO 3244-1974 — Prin 
ciples Governing the Positioning of Control Keys on Key 
boards. 

The fact that these are "guidelines” and “principles” indi- 
cate the complexity of the subject. Typewriter manufac- 
turers now supply over a hundred different keyboard arrange- 
ments, as their catalog will indicate. 
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CODE EXTENSION — GENERAL PRINCIPLES 

Ov e r ten years ago it was recognized that ASCII was 
the basis for codification of the various symbols used 
throughout the world. Through it, libraries could store 
encoded books as well as printed books. And while elec- 
tronic mail may be quite simple with ASCII and its 
R °' h 1 ®" alphabet ’ that ’s not the alphabet of all countries. 
, he . US ® R uses Cyrillic, the Japanese use Katakana, and 
the Arab world uses its own semi-script alphabet. More- 
over. to send a mathematics textbook by electronic mail 
one would have to be able to encode the formulas and 
special symbols peculiar to mathematics, which in- 
eludes many Greek characters! 

This is where the ESCape character and ESCape se- 
quences come in. You can get the whole complicated 
story from ISO Standard 2022 (or ECMA-35) on Code Ex- 
tension Procedures. But it will be easier to think of 
reproducing many ASCII Code Tables on the pages of a 
book, then replacing the ASCII symbols on all but the 
first page with the other alphabets we need. 

Then we make sure that everyone in the world has the 
same (code) book. (The resemblance to military code 
books is intentional.) That’s done by registering the 
page number assignment to characters (actually either a 
control set or a graphic set, but not both) with the 
French Standards Body AFNOR. That's the Association 
Francaise de Normalisation, Tour Europe Cedex 7 
92080 Paris La Defense, FRANCE. But you’ll find it 
perhaps easier to get it from ANSI (see data in the first 
installment, INTERFACE AGE, May 1978). 

The registration procedure is spelled out in ISO Stan- 
dard 2375. It is carefully controlled to prevent frivolity 
and cluttering up the assignment books, for that all 
costs money. But the important control and graphic sets 
of the world may be registered and assigned their own 
unique ESCape sequence for calling or invoking them. 

CODE EXTENSION — BASIC RULES 

The control ESC, when encountered in a datastream, 
means that all characters following it, up to and including 
the first character from sticks 3 to 7, have special interpre- 
tation. The delimiting character is called a “final” (F). 
Those between ESC and the final are called “intermedi- 
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ates” (I). All of the codes in stick 2 can serve as interme- 
diate characters in ESCape sequences of 3 or more char- 
ln len 9 th. The entire group of characters from 
ESC through the final is called an ESCape sequence. 

ESCape sequences obviously require buffers for inter- 
pretation, for we cannot know, when they begin, how 
long they will be. Sequences of length 2 are for sinqle 
controls. If the character following ESC is from stick 3 
the sequences are for private usage, of the class Fp. If it 
is from sticks 4 or 5, they mean single controls, of the 
class Fe, from an appropriate set of 32. If from sticks 6 
or 7 (except 7/15), they are of the class Fs, composed of 
single controls. This is elementary extension. 

A more complex type of extension is the simulation of 
one or more 8-bit character sets by alternating between 
two 7-bit sets. The home base set consists of the CO (32 
controls) set and the GO (94 graphics plus space and DEL) 
The alternate sets consist of the Cl (32 controls) set and 
and the G1 (94 graphics plus space and DEL). The 8-bit 
set (it doesn’t have to be just theoretical if you have a full 
8-bit capability) consists of the fourparts C0-G0-C1-G1. 

These four types of sets are all invoked (designated) 
by 3-character ESCape sequences in this manner, where 
F is the final (3rd) character: 

Sequence Invokes Set Type 

ESC 2/1 F C Q 

ESC 2/2 F Cl 

ESC 2/8 (or 2/12) F GO 

£SC 2/9 (or 2/15) F 61 

The final character “F” selects the particular set to in- 
voke. Once invoked, encountering or entering an SO 
shifts to the G1 set in force; an SI shifts to the GO set in 
force. SO and SI do not affect the control set. 

ISO Standard 2022 defines these matters in far more de- 
tail, but that is enough for here. That document is compli- 
cated and ingenious, and deserves substantial study. 

THE CODE EXTENSION REGISTRY 

Table 1 identifies the graphic sets registered to date. 
Table 2 identifies the control sets registered to date. Re- 
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Regi s. 

Final 


No. 

Char. 

Name 

002 

4/0 

IRV (Inti. Reference Version) Graphics 

004 

4/1 

UK Graphics 

006 

4/2 

US Graphics (ASCII) 

008-1 

4/3 

NATS Main Graohic Set (Finland / Sweden) 

008-2 

4/4 

NATS Additional Set (Finland, Sweden) 

009-1 

4/5 

NATS Main Graphic Set (Denmark, Norway) 

009-2 

4/6 

NATS Additional Set (Denmark, Norway) 

010 

4/7 

Swedish Basic Graphics 

Oil 

4/8 

Swedish Graphics for Names 

013 

4/9 

J IS Katakana Graphics 

01 4 

4/10 

JIS Roman Graphics 

015 

5/9 

Italian Graphics 

017 

5/10 

Spanish Graphics 

018 

5/11 

Greek Graphics 

019 

5/12 

Latin-Greek Graphics 

021 

4/11 

German Graphics 

025 

5/2 

French Graphics 

027 

5*5 

Latin-Greek Mixed Graphics (Greek Capitals only) 

031 

5/8 

Greek Alphabet Set for Bibliographic Use 



For a GO set the ESCape sequence is 



ESC 2/3 plus the final shown. 



For a G1 set the ESCape sequence is 



ESC 2/9 plus the final shown. 


Table 1 

Registered Graphic Character Sets 



Regi s. 

Final 


No. 

Char. Name 


001 

4/0 ISO 646 Controls 


007 

4/1 Scandinavian Newspaper Controls 


026 

4/3 IPTC Controls 


The SSCape sequence for a CO set is 
ESC 2/1 plus the final shown. 


Table Vb. Registered Control Character Sets 

Table 2. Registered Control Character Sets 


The registry set is available from AFNOR for approxi- 
mately 172 French francs, say $35. It would be vital for 
an equipment or software manufacturer to have it, and it 
comes in a beautiful 4-ring binder symbolizing world- 
wide interchange compatibility. But the summary pro- 
vided here will fill most needs. 


. . .the work I had to do to 
compact the standard, trying 
to make it understandable, turned 
up more than unreadability. So 
it’s back to the drawing board, 
perhaps for a considerable period 
of time. . .it’s sometimes useful 
to have symbols whose meaning 
you can reassign without harm 
to programming languages. . . 


CONTENT OF THE EXTENDED SETS 

Figure la shows, against the ISO Code, International 
Reference Version, how the other graphic sets differ in the 
column/row positions shown. The rows are keyed to Table 
1, reminding you that ASCII is “006", or “ISO 646-006”. 

From this figure we can see that many countries need 
accented letters as individual characters, not compound 
via BS (Backspace). This is particularly true for the double 
sets 008 and 009, for Scandinavian newspaper trans- 
mission, which have characters that cannot be made 
from ASCII in compound form. For example — Ring-A, a 
solid, and the angle open and closed quotes. 
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Figure la doesn’t show Set 031 because it deviates 
more and is not of that much general interest. It doesn't 
show the Japanese Katakana set because that is com- 
pletely different from the IRV. In fact, Japanese Indus- 
trial Standard C6220-1969 is an 8-bit coded set with the 
IRV (see Set 014 for the dollar and yen signs) in the lower 
(bit 8 = 0) portion, and Set 013 in the higher portion, 
with space reserved for future additional controls. This 
Set 013 is shown in Figure 2. It is shown in its high-order 
position, to indicate the card codes at the same time. 

Figure 2 also shows the Cyrillic set of the USSR state 
standard GOST 13052-67, but it is not half of an 8-bit set 
as the Japanese do it. Rather it is another page of exten- 
sions. After SO (Shift Out) is used, the Russian register 
is operative. Following SI (Shift In) it is the IRV. Although 
this set has no registry number now, it was submitted 
recently by ECMA, and we expect an assignment soon. 
3y the way, both Katakana and Cyrillic are shown in 
:heir OCR font. 


Figure 3 shows the contents of the registered control 
sets. Set 007 serves as control set for the graphic sets 
008-1,2 and 009-1,2, for Scandinavian newspaper trans- 
mission. And set 026 is the control set for the worldwide 
newspaper transmission, defined by the IPTC (Interna- 
tional Press Telecommunications Council). The 18 con- 
trol positions not shown, and those where there is no 
entry, are the same as in the International Reference 
Version (646-001). 

These newspapers are driving composition equip- 
ment, not line printers, so they don’t need VT and FF. 
Their set is already defined, so they don't need SO and 
SI. They have (properly) assigned meaning to three 
device controls. And they’re probably not doing payroll, 
so they don’t need the four information separators. But 
they do transmit, and instead of choosing their own 
functions and placement they have chosen to be a 
registered variant of the ISO Code. And all variants 
within this controlled and registered cluster can at least 
recognize each other, even if they can’t print it! 
















































































































IRV 





Position 

001 

007 

026 



0/09 

HT 

F0 

F0 

Format Control 


0/11 

VT 

ECD 

ECD 

End (a typographical) CommanD 1 

0/12 

FF 

SCD 

SCD 

Start ( " 


0/13 

CR 

QL 

QL 

Quad Left 


0/14 

SO 

UR 


Upper Rai l 


0/15 

SI 

LR 


Lower Rail 


1/01 

0C1 


Font 

1 Change to normal 


1/02 

DC2 


Font 

2 Change to italic 


1/03 

DC3 


Font 

3 Change to bold 


1/08 

CAN 

KW 

KU 

Kill Word (through 

previous space) 



ss 

SS 

SuperShift 


1/13 

GS 

QC 

QC 

Quad Center 


1/14 

RS 

QR 

QR 

Quad Right 


1/15 

US 

JY 

JY 

JustifY 


Figure 3. Registered Control Character Substitution 


CODE EXTENSION IN ACTION 


To illustrate the operation of code extension, let’s im- 
agine some equipment that may not exist now: 

•A microfiche reader with automatic location controls. 
•A microfiche with ASCII (the 8-bit form) on the first two 
pages, the other pages containing other sets such as 
Katakana, Cyrillic, Arabic, Greek, Hebrew, mathemati- 
cal symbols, astronomical symbols, etc. Also, symbol 
sets for selecting typestyles, weights, rotations, sizes 
and elongations. 

A display screen for the microfiche; it is touch-sensi- 
tive and generates 7-bit codes according to location 
touched on the display. 


nately, the work I had to do to compact the standard, try- 
ing to make it understandable, turned up more than 
unreadability. It turned up many logical flaws and ambi- 
guities. So it s back to the drawing board, perhaps for a 
considerable period of time. 

Figures 4a through 4e will give, however, some flavor 
of the controls under consideration. 

Figure 4 shows the controls of Format Type (FT) 1 and 2 
Format 1 is either the single character of the 8-bit set 
shown in the first column as “Ce”, or the 2-character se- 
quence of the type “ESC Fe", where Fe is a final charac- 
er taken from 4/00 to 5/15, and whose column desiqna- 

lon .'V, less than Ce - l e - in an 8-bit code, INDex would 
be 8/04. In a 7-bit code it would be ESC 4/04. Format 2 is 

* Im . type * ESC Fs ”' wh ere Fs is a final taken from 6/00 
10 7/14. 

Figures 4b through 4e show controls with formats be- 
ginning with the control “CSI”, defined in Figure 4a to 
be either 9/11 (in the 8-bit set) or “ESC [” (in the 7-bit 
sets). The six possible formats are: 


3a = CSI Pn F 
3b = CSI Pn ; Pn F 
3c = CSI Ps ? 


4a = CSI Pn I F 
4b = CSI Pn ; Pn I F 
4c = CSI Ps I F 


•As an alternative, keyboard tops with fibre optic bundles 
molded in as a matrix, so that the keytops can be lighted 
with different symbols as selected. 

Now imagine that we are writing an astrology book: 
•Type 


Those of you born under the sign of Aries C 

•Depress the "astro" key on the special keyboard 
•Notice the shift in display for the fiche screen and/or 
the keytop lighting 

•Touch the Aries symbol on the screen (or the keytop) 
•Depress SI (Shift In) on the special keyboard 
•And return to typing the rest of the sentence 

) will find this month ... 

Now imagine what a computer would do to the input 
stream in driving photocomposition equipment. The 
astro [ley generated an ESCape sequence for an astro- 
nomical graphic symbol set that would have been regis- 
tered by AFNOR. When the input parser recognizes ESC, 
it analyzes the following characters, and then calls this 
set of character formation methods from the backup 
store, generates the character shape for Aries according 
to the character code after the final character, notices 
SI, and returns to normal mode. 

Now we can envision how all of the world’s printed 
material can be stored in machine-readable form, and in- 
terchanged recognizably! 

ALTERNATE CONTROLS 

Work has been in progress for several years to develop 
a companion standard for controls for devices such as 
CRT terminals. In the US this is contained in the ANSI 
document BSR X3.64, Additional Controls for Character 
Imaging. In a similar form, this Cl set is before the 
Codes Committee of ISO Technical Committee 97 (Com- 
puters and Information Processing) as document 2 N 
868, for consideration at its 1978 May 24-26 meeting. 

I had hoped to give the essence of this work in this in- 
stallment. There were only two negative votes in X3, 
which one could presume might be answered. Unfortu- 


Pn stands for numeric parameter(s), Ps for a variable num- 
ber of selective parameters separated by semicolons. 
The type 4 formats differ from type 3 only in inserting 
the intermediate character 2/00 just prior to the final. 

In the figures, the parameter value enclosed in paren- 
theses , s the default value. That is, if the parameters are 
not actually inserted, i.e., being null, then the effect is 
the same as if the default value(s) were inserted 
To give an example of how these controls operate 
look in Figure 4d for the second mnemonic, SGR (Select 
Graphic Rendition). It is represented first by CSI the 

?° n , tr c °, l ,o S tT uence lntroducer ’ ‘he parameter, and the 
final 6/13. This means that when the 4-character string 

ESC [ 6 m 

is encountered, it should turn on rapid blink in the field(s) 
specified on your video screen. 


AL = Active Line (containing AP) 

AP = Active Position (where the cursor is) 

EF - Editor Function 

FE = Format Effector 

HT = Horizontal Tabulation 

IN - INtroducer 

PAD = Primary Auxiliary Device 

RD = Received Datastream 

SAD = Secondary Auxiliary Device 

SD = String Delimiter 

VT = Vertical Tabulation 

DA = Qualified Area (defined by DAQ, SPA, EPA) 
rfs - reserved for future standardization 


Abbreviations for Figures 4a through 4e. 
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in t > field(s) 


FT Type Param 



3/00-03 1 

8/04 1 

8/05 1 

8/06 1 

8/07 1 

8/08 1 

8/09 1 

8/10 1 

3/11 1 

3/12 1 

3/13 1 

'714 1 

; 8/15 1 

9/00 
9/01 
9/02 
9/03 
9/04 
9/05 
9/06 
9/07 
9/08-10 
9/11 
9/12 
9/13 
9/14 
9/15 


6/00 2 
6/01 2 
6/02 2 
6/03 2 


(rf s) 

INDex 
NExt Line 

Start of Selected Area 

End of Selected Area 

Horizontal Tabulation Set 

Horiz. Tabu l . with Justification 

Vertical Tabulation Set 

Partial Line Down 

Partial Line Up 

Reverse Index 

Single Shift 2 

Single Shift 3 

Device Control String 

Private Use 1 

Private Use 2 

Set Transmit State 

Cancel CHaracter 

Message Waiting 

Start of Protected Area 

End of Protected Area 

(rfs) 

Control Sequence Introducer 
String Terminator 
Operating System Command 
Privacy Message 
Application Program Command 


Disable Manual Input 
INTerrupt 

Enable Manual Interrupt 
Reset to Initial State 


Final 

FT 

Type 

Param 

Mnem 

6/00 

3a 

FE 

(1) 

HPA 

6/01 

3a 

FE 

(1) 

HPR 

6/02 

3a 


(1) 

REP 

6/03 

3a 


(0) 

DA 

6/04 

3a 

FE 

(1) 

VPA 

6/05 

3a 

FE 

(1) 

VPR 

6/06 

3b 

FE 

(1 ;1) 

HVP 

6/07 

3c 

FE 

(0) 

TBC 


Figure 4a. Controls for Character-Imaging Devices 


Final 

FT 

Type 

Param 

Mnem 

4/00 

3a 

EF 

(1) 

ICH 

4/01 

3a 

EF 

(1) 

CUU 

4/02 

3a 

EF 

(1) 

CUD 

4/03 

3a 

EF 

(1) 

CUF 

4/04 

3a 

EF 

(1) 

CUB 

4/05 

3a 

EF 

(1) 

CNL 

4/06 

3a 

EF 

(1) 

CPL 

4/07 

3a 

EF 

(1) 

CHA 

4/08 

3b 

EF 

(1 ;1) 

CUP 

4/09 

3a 

EF 

(1) 

CHT 

4/10 

3c 

EF 

(0) 

ED 




1 





2 


4/11 

3c 

EF 

(0) 

EL 




2 


4/12 

3a 

EF 

(1) 

IL 

4/13 

3a 

EF 

(1) 

DL 

4/14 

3c 

EF 

(0) 

EF 




1 





2 


4/15 

3c 

EF 

(0) 

EA 




2 


5/00 

3a 

EF 

(1) 

DCH 

5/01 

3c 


(0) 

SEM 




1 





2 





3 


5/02 

3b 


(1;1) 

CPR 

5/03 

3a 

EF 

(1) 

SU 

5/04 

3a 

EF 

(1) 

SD 

5/05 

3a 

EF 

(1) 

NP 

5/06 

3a 

EF 

(1) 

PP 

5/07 

3c 

EF 


CTC 


5/08 

3a 

EF 

(1) 

ECH 

5/09 

3a 

EF 

(1) 

CVT 

5/10 

3a 

EF 

(1) 

CBT 


Insert CHaracter 
CUrsor Up 
Cursor Down 
CUrsor Forward 
CUrsor Backward 
Cursor Next Line 
Cursor Preceding Line 
Cursor Horizontal Absolute 
CUrsor Position 
Cursor Horizontal Tabulation 
Erase in Display 
From AP to end (inclusive) 
From start to AP (inclusive) 
All of display 
Erase in Line 
From AP to end (inclusive) 

: From start to AP (inclusive) 
All of line 
Insert Line 
Delete Line 
Erase in Field 
From AP to end (inclusive) 
From st.art to AP (inclusive) 
All of field 
Erase in Area 
From AP to end (inclusive) 
From start to AP (inclusive) 
All of QA 
Delete CHaracter 
Select editing Extent Mode 
Edit in display 
Edit in AL 
Edit in field 
Edit in_QA 

Cursor Position Report 
Scroll Up 
Scroll Down 
Next Page 
Preceding Page 
Cursor Tabulation Control 
Set HT stop at AP 
Set VT stop at AL 
Clear HT stop at AP 
Clear VT stop at AL 
Clear all HT stops in AL 
Clear all HT stops in device 
Clear all VT stops in device 
Erase CHaracter 
Cursor Vertical Tabulation 
Cursor Backward Tabulation 


Horizontal Position Absolute 
Horizontal Position Relative 
REPeat 

Device Attributes 
Vertical Position Absolute 
Vertical Position Relative 
Horiz. and Vertical Position 
TaBulation Clear 
Clear HT stop at AP 
Clear VT stop at AL 
Clear all HT stops in AL 
Clear all HT stops 
Clear all VT stops 
Set Mode 

Guarded Area Transfer Mode 
Keyboard Action Mode 
Control Representation Mode 
Insertion-Replacement Mode 
Status Reporting Transfer Mode 
ERasure Mode 
Vertical Editing Mode 
(rfs) 

(rfs) 

Horizontal Editing Mode 
Positioning Unit Mode 
Send-Receive Mode 
Format Effector Action Mode 
Format Effector Transfer Mode 
Multiple Area Transfer Mode 
Transfer Termination Mode 
Selected Area Transfer Mode 
Tabulation Stop Mode 
Editing Boundary Mode 
Line feed New line Mode 
Media Copy 
To PAD 
From PAD 
To SAD 
From SAD 

Turn OFF copying RD to PAD 
Turn ON copying RD to PAD 
Turn OFF copying RD to SAD 
Turn ON copying RD to SAD 


Figure 4c. Controls for Character-Imaging Devices 


FT Type Pa ram 


6 / 10-11 
6/12 3c 


(rfs) 

Reset Mode 

(same parameters as SM) 

Select Graphic Rendition j 

Primary rendition 
Sold, or increased intensity 
Faint > decreased intensity, 
or secondary color 
Italic. 

Underscore 

Slow blink. « 2.5/second) 

Rapid blink (> 2.5/second) 

Negative (reverse) image 
(rfs) 

(rfs) 

Primary Font 

1st to 9th alt. font (via FNT) 
Fraktur 

Device Status Report 
Ready, no malfunctions detected 
Busy - retry later 
Busy - DSR will notify ready 
Malfunction - retry 
Malfunction — DSR will notify ready 
Please report status (DSR or DSC) 
Please report AP via CPR 
Define Area Qualification 
Accept all input 
Accept no input (protected); 

do not transmit (guarded) 

Accept graphics 
Accept numerics' 

, Accept a Iphabetics 
Right justify in area 
Zerofi It in area 

HT stop at start of area (field) 
Accept no input (protected); 

permit transmit (unguarded) 
Spacefill in area 


Figure 4b. Controls for Character-Imaging Devices 


Figure 4d. Controls for Character-Imaging Devices 
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Code Symbol 


Code Symbol 


Flnal - F T Type Param 


Mnem Name 


4/00 

4/01 

4/02 

4/03 

4/04 


4a 

4a 

4b 

4a 

4b 


£F 

EF 

FE 

FE 

FE 


CD SL 

CD SR 

(100;100)GSM 
GSS 


C0;0) 
C0;0) 
1 ;0 


FNT 


Scroll Left 
Scroll Right 

Graphic Size Modification 
Graphic Size Selection 
FoNT selection 
Primary font 
First alternative font 


4/05 

4/06 


4a 

4c 


9;0 


10/00 
10/01 
10/02 
10/03 
10/04 
'10/05 
10/06 
10/07 
10/08 
10/09 
10/10 
1 10/11 
10/12 
10/13 
[10/14 
10/15 


CO) 

1 


TSS 

JFY 


4/07 

4/08 


4b 

4c 


(0) 

1 

2 

3 

4 

5 


SPI 

QUAD 


Ninth alternative font 
Thin Space Specification 
JustiFY 

Terminate all justify actions 
Fill action ON 
(text to/from other lines) 
Interword spacing 
Letter spacing 
Hyphenation 
Flush left margin 
Center text between margins 
Flush right margin 
Italian form (underscore last) 
SPacing Increment 
Quad 

Flush left 

Flush left, fill with leader 
Center 

Center, fill with leaoer 
Flush right 

Flush right, fill with leader 


Csame as 02/00) 
Opening double quote 
Closing double quote 
Club suit 
Diamond suit 
Heart suit 
Spade suit 

Closing single quote 

Is implied by 

Impli es 

Mult ip ly 

Plus or minus 

Nabla, or del 

Em dash 

Radix point 

Divi de 


11/00 

11/01 

11/02 

11/03 

11/04 

11/05 

11/06 

11/07 

11/08 

11/09 

11/10 

11/11 

11/12 

11/13 

11/14 

11/15 


Figure 4e. Controls for Character-Imaging Devices 


12/00 
12/01 
1 12/02 
12/03 
12/04 
| 1 2/05 
12/06 
[12/07 
ll 2/08 
12/09 
ll 2/10 
12/11 
ll 2/12 

12/13 

12/14 

12/15 


Large circle 
Dagger 

Superior (superscript) 2 
Superior (superscript) 3 
Rectangle 
Parallel 

Partial derivative 
Lower left corner, floor 
Upper left corner, ceiling 
Upper right corner 
Lower right corner 
Perpendicular 
Less than or equal 
Not equal, other than 
Greater than or equal 
Paragraph mark, pilcrow 


Section mark 
Double dagger 
Dot bullet 
Capital theta 
Capital delta 
At least one exists 
Capital phi 
Capital gamma 
Upward arrow 
Right arrow 
Dot product 
Degree 

Capital lambda 
Regi ster 
Copyright mark 
Capital xi 


13/00 

13/01 

13/02 

13/03 

13/04 

13/05 

13/06 

13/07 

13/08 

13/09 

13/10 

13/11 

13/12 

13/13 

13/14 

13/15 


Capital pi 
Capital psi 
Square bullet 
Capital sigma 
Integral 
Capital upsi Ion 
Therefore 
Capital omega 
Downward arrow 
Left arrow 
Approximately equal 
Opening angular bracket 
Logi cal AND 

Closing angular bracket 

Logical NOT 

Infinity 


14/00 
14/01 
14/02 
14/03 
14/04 
14/05 
14/06 
14/07 
14/08 
14/09 
14/10 
14/11 
14/12 
14/13 
14/14 
1 4/15 


Opening single quote 

Small alpha 

Small beta 

Small theta 

Small delta 

Sma l l epsi Ion 

Small phi 

Small gamma 

Small eta 

Small iota 

Identically equivalent 

Small kappa 

Small lambda 

Sma l l mu 

Small nu 

Small xi 


15/00 

15/01 

15/02 

15/03 

15/04 

15/05 

15/06 

15/07 

15/08 

15/09 

15/10 

15/11 

15/12 

15/13 

15/14 

15/15 


Sma 1 1 pi 
Small psi 
Small rho 
Small sigaa 
Small tau 
Sma 1 1 upsi Ion 
Check nark, radical mark 
Small omega 
Small chi 

Logical universal quantifier! 
Small zeta 
Cap intersection 
Logical OR 
Cup, union 
Ove rbar 
(same as 7/15) 


Table 3. Names of the Additional Graphics, 8-bit Set 
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Figure 5. 8-bit ASCII Proposal 
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rscript) 2 
rscr' ) 3 


ti ve 

ner, floor 
ner, ceiling 
rner 


equa' 
r br. et 


a l quantifier] 


8-bit Set 


CODE EXPANSION 

We have seen how ASCII was extended by making 
many related pages of the 7-bit code. It is also possible 
to expand ASCII into an 8-bit code, or even 9-bit and 
10-bit if we wished, for that matter. But an 8-bit code is 
obviously the most logical one to concentrate on, and 
this has been under development for several years. 

The proposed 8-bit Expanded ASCII Code is shown in 
Figure 5. The identification of the graphic symbols is 
given in Table 3. I 

One can observe many interesting things about this 
set. For example, it has the entire Greek set of small let- 
ters except for “omicron", with eleven capitals to go 
with others from the Roman capitals to complete the 
Greek set. But apparently the committee didn’t follow 
646-031. the Greek alphabet mentioned in Table 1. They 
didn’t use the customary ordering “alpha-beta-gamma , 
the way we learn our “a-b-c's". I suppose it is argued 
'bat this set will never be used for language, only math 
symbols. And 646-027, shown in Figure la, does not de- 
mand the special capital ''upsilon” shown in position 
13/5. If the Greeks can agree to using a Roman capital 
“Y" for upsilon, could the Americans? 

You'll notice some math symbols, but not enough for 
APL. In fact, the whole set seems highly slanted to mathe- 
matics. rather than business. Of course there are the 
four corner symbols for forms. Presumably the card 
suits will strike your eye. and you will wonder why so 
many other useful symbols were ignored in favor o 
these. Don't worry, they will always come in handy; it s 
sometimes useful to have symbols whose meaning you 
can reassign without harm to programming languages, 
etc. The committee were obviously bridge players, for 
spades collate high. 

This proposal has not had real public scrutiny yet, and 
it must be considered no more than a proposal. Presum- 
ably X3 will agree about July that it should be sent out 
for formal public review and letter ballot. My guess is 
that it will not be adopted in just the form you see here. 

FUTURE FOR ASCII 

The methods are in place for codifying all symbols 
that people use. They may be language alphabets, signs, 
drawing symbols, or controls for equipments. Robots, 
for example. Satellites are augmenting conventional tele- 
communications systems, so that one can borrow cheaply 
and permanently from electronic libraries. 

To prepare for this, other sets are being developed for 
registry, many through ISO Technical Committee 46/1, 
Automated Documentation. A 2-page mathematical 
symbol set is near submission, as are African sets. 
Work is started for Arabic, which will take about 5 sets 
to handle fully, although there is a commercial subset of 
94 graphics. Another Cl set is being proposed for biblio- 
graphic controls. It contains four types — annotation 
controls, filing controls, reference controls, and subject 
designators. Other Cl sets can come from process con- 
trol, animation and other graphics applications, etc. 

West Germany has proposed a new ISO project on text 
communication, to harmonize teleconnection of the more 
than one hundred varieties of typewriters (and key- 
boards) throughout the world. The extension method ot 
multiple 7-bit codes is ideal for this (8-bit codes imply too 
many keys or shift combinations for people to use easily. 

I am convinced that microcomputer users are going to 
develop some fantastic applications that will become 
widespread enough for their special graphic and control 
sets to be registered. How about a control set or two for 
sewing machines? 

In fact, it is very difficult to think of any general applica- 
tion where one could not find a usage for these regis- 
tered variants and extensions. Q 
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date 1978 August 8 


phone 8-357-2569 maiuzone C61 


COPIES 


to LW (Roy) Beers 


from Berner 


tmponent Advanced Systems Engineering 
subject ASCII - SOME BACKGROUND 


Warning! The information here may be more than, or not what, 
you want to know about ASCII. 


ASCII has been the American Standard Code for Information 
Interchange since 1963. It has been (identically) the 
International Standard Code since 1973. For the US Govern- 
ment it is also Federal Information Processing Standard 1, 
per signature of Lyndon B Johnson in 1968. It is Alphabet 5 
for the CCITT, the ruling body for international communication. 
It is a state standard for the Russians and the Japanese, in 
conjunction with their own Cyrillic and Katakana. 

Why then is it not universally used? 

ORIGINS 



% 

-A 



In 1958, I was in charge of Logical System Standards for IBM. 
Prior to that time most IBM systems used a 6 -bit code, which, 
permitted only upper case (capital) alphabet. ALGOL was coming 
along, as were many other new developments demanding lower 
case as well. Stretch had an, 8-bit character in a 64-bit word 
(I was at Los Alamos with Fred Brooks in 1958 on this matter). 


For the 8000 (later 360) my group was developing a coalescence 
of the existing nine (!) different BCD codes , 8-bits wide, and 
compatible on all media as well as internally. Word got out, 
and I was invited by John Gosden (now VP Equitable Life and 
adviser to the President's OIP) to show the work to the British 

Standards Institution, which had an advanced working party on 
these matters. H. McGregor Ross was the chief worker. I was 
also asked to show it to an EIA TR committee (which was where I 
first met Eric Clamons). 

At this time, IBM spearheaded a movement to convert the Office 
Equipment Manufacturers Association (a moribund operation) into 
BEMA. The primary purpose was to control the inevitable movement 
to interchange standards. I wrote the draft scope and program of 
work for X3 . - 
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John Auwaerter of Teletype, who developed the Models 33, 35, 
37 and ink-jet printer, participated heavily in code work 
under ATT auspices. Basically the code ended up very much 
like my original proposals (I take some pride that it was 
for a time called the Bemer-Ross code in Europe). 

IBM ACTIONS 


The 360 was, as everyone knows, announced prematurely. Fred 
Brooks (of Brooks, Amdahl, and BlaauW — 360 designers) told 
me that the printers and card punches were not ready in ASCII, 
and therefore Learson (ex-Chairman of the IBM Board) made the 
decision for EBCDIC — Extended BCD Interchange Code. Presumably 
the PI bit was there so that the 360 could run in ASCII, but 
the software to do so never got implemented. Those familiar 
with the difficulties in getting the 360 to work will appreciate 
this. 

To this day IBM has never released EBCDIC for standardization 
(which it could under the code extension and registry provisions 
of ISO). It is a proprietary code, and IBM states publicly that 
they alone reserve the right to modify it. 

There are several different codes still in IBM equipment! The 
System 3 code, for example, is an (absolutely incompatible) 
permutation of the 360 EBCDIC. I have not checked System 1 yet. 
IBM obviously intends to handle all such conversions with hard 
chips, and let the competitors figure it out some way. It’s by 
now an integral part of IBM's anti -plug-compatible strategy. 

HIS HISTORY 

The 600/6000/66 evolutionary line is based on a near copy of the 
pre-360 IBM 7094. It has a 36 -bit word of 6 characters of 6 bits 
each (BCD) . 

The GE Bull M-140 (circa 1966) was designed internal ASCII. But 
GE cancelled it, and eventually mounted a new-line effort under 
Richard Bloch. He wanted to confront IBM all the way with a 
compatible CPU like RCA did. I worked for Bloch, but knew about 
the RCA efforts on the inside due to a friend of my wife's (a 
story for my memoirs!). Therefore I wrote a series of memos on 
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continuing the 600 architecture instead (they make interesting 
reading even now ) 0 With the help of many (Walker Dix included) 
this came to be. As far as I know, I was the only internal GE 
employee on the study teams to sell the computer business. 

With the Level 64, however, Honeywell Bull began to push EBCDIC 
strongly. I believe that Spangle concurred. But Phoenix has 
never been able to implement this because of the investment in 
software, particularly timesharing software. In other words, 
we live with BCD in batch and ASCII in timesharing. 
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HIS NOW 

The ASCII software uses a 9-bit byte, 4 to a word, even though 
ASCII is principally 7 bits (but going on 8 according to 
international work in expanding the standard) 0 Level 6 is 
completely ASCII. 

The product calendar apparently contains a project to make a 
completely ASCII version of the L66 system. I don't know if it's 
software only, or a hardware change as well. If hardware also, 
it should be a 2-step change — software first, not concurrently. 

One motivation for this is that new and different software is 
salable and rentable, whereas the old is not. Another is net- 
working with Level 6. This will not be an easy project. Our 
current GCOS III is ill-structured and overly patched already, 
and no one has been willing so far to spend any money to make 
it healthy. Code relationships are implicit in the programs, 
but nowhere documented. It's rewrite and retest all the way I 


EXTERNAL FACTORS 

Teletype now accommodates EBCDIC as well as ASCII in their 
terminals. They do so to make a buck in the IBM environment, as 
many do. ASCII is the code for "clear” traffic, but the X. 25 
protocol permits "private" traffic and thus EBCDIC 6 

IBM has -stated loudly and clearly that they will use codes of 
many more bits than 8 for future systems. The fibre optic 
interface gives us this clue also. They know that EBCDIC is 
too ill -structured to expand, whereas ASCII is designed for just 
such purposes. They are prepared to move to ASCII. Such a move 
would jolt the add-on people terribly. 
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The microcomputer world is all ASCII. Of course HIS may 
never use them in its systems, but it might be wise not to 
preclude the option. 
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Computing 
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October 1979 


The bibliography contains nearly 750 annotated 
and indexed citations to papers, books, and other 
items relating to the origins of the modem 
electronic computer. The topics covered range from 
early digital calculating devices and mechanical 
automata to the first stored program computers. 
New entries were added up to June, 1979. 

Keywords and phrases: calculating machines, 
sequence control mechanisms, cryptanalytic 
devices, punched card machinery, 
electromechanical and relay calculators, stored 
program electronic computers 
CR categories: 1.2, 6.21, 6.4 


Introduction 

This bibliography is a revised and considerably 
extended version of that appearing in The Origins 
of Digital Computers: Selected Papers (Ed. B. Ran- 
dell: Springer Verlag (1973)). It concentrates on 
the period from Babbage to the invention of the 
stored program electronic digital computer in the 
mid- 1940s, but also contains items relating to 
earlier work on mechanical calculation and on 
devices incorporating sequence control mech- 
anisms. The bibliography covers events up to 
about 1949 when the earliest stored program 
computers became operational, but no attempt 
has been made to cover the explosive growth in 
the literature on electronic computers which oc- 
curred after 1946, when a large number of com- 
puter projects were started in the United States 
and elsewhere based directly on the plans for the 
EDVAC or the Institute for Advanced Study 
computers. All the items listed have been in- 
spected, sometimes rather cursorilv. by the au- 
thor. (Some items which are still being sought are 
listed in Appendix 1, as a challenge to interested 
readers.) 

The bibliography is listed in alphabetical order 
by author; anonymous items are listed last. In 
order to facilitate use of the index, when more 
than one publication is listed for a given author 

© 1979, American Federation of Information Processing 
Societies, Inc. Permission to copy without fee all or part of 
this material is granted provided that copies are not made or 
distributed for direct commercial advantage, the AFIPS copy- 
right notice and the title of the publication and its date appear, 
and notice is given that copying is by permission of the Ameri- 
can Federation of Information Processing Societies. Inc. To 
copy otherwise, or to republish, requires specific permission. 

Author’s address: Computing Laboratory, The University 
of Newcastle upon Tyne, Claremont Tower, Claremont Road, 
Newcastle upon Tyne NE1 7RU, England. 

© 1979 AFIPS 0164-1239/79/020101-107 $02.00/0 


the name of the author is followed bv the year of 
publication in parentheses. Where there is more 
than one publication in one year, the year is 

followed by a, b Although the index is fairly 

detailed, readers are warned to check the more 
general works on a given subject, as well as those 
explicitly listed under a particular index entry. 

In the remainder of this introduction, a brief 
survey is attempted of some of the more inter- 
esting groups of items which are new entries in the 
bibliography, with special attention given to those 
which add significantly to the information in the 
set of chapter introductions in The Origins of Digital 
Computers. (The annotations with the recently 
added items are, in general, more detailed than 
those for items which were in the original 
bibliography.) 

Brief comments are also given on some of the 
larger bodies of historical material which particu- 
larly merit study by historians of science. One 
such body of material, that arising from the 
so-called ENIAC patent suit, is described at some 
length in Appendix 2. Another large collection of 
technical and patent literature, covering broadly 
similar ground, is that donated by G. R. Stibitz to 
Dartmouth College Library (see Anon (1973b)). 
Also worthy of explicit mention is the well- 
cataloged and indexed collection of about 500 of 
K. Zuse’s technical papers, letters, patents, etc. 
held at the Gesellschaft fur Mathematik und 
Datenverarbeitung, Bonn. In general, however, 
the patent literature is not covered in the present 
bibliography. It is to be hoped that before long 
someone with the necessary specialist skills and 
experience will undertake the task of surveying 
this literature for material relating to the origins 
of digital computers. 

General Works 

In the last few years there has appeared a number 
of very useful books on the history of calculators 
and computing. The best known, at least in the 
English speaking world, is one from The Office 
of Charles and Ray Eames, edited bv G. Fleck, 
entitled A Computer Perspective. Quoting from the 
review by D. E. Knuth in Historia Mathematical 
“The style of presentation is aimed at a popular 
audience and it is a fascinating wav to present 
history; it depicts the spirit of the times, giving 
isolated details rather than stating a large number 
of organized facts. But it is also useful as a guide 
for scholarly research, since the picture credits 
and index at the rear of the book are quite 
complete.” 
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SECTION 1 


GENERAL 


Recommendation V.l 


equivalence between binary notation symbols 

AND THE SIGNIFICANT CONDITIONS OF A TWO-CONDITION CODE 
(New Delhi, I960: amended at Geneva. 1964 and 1972) 


Binary numbering expresses numbers by means of two digits normally represented by the symbols 0 and 1. 
Transmission channels are especially well suited to the transmission of signals by a modulation having two 
signiticant conditions (two-condition modulation). These two significant conditions are sometimes called “space” 
and 'mark" or "start” and “stop”, or they may be called condition A or condition Z [1], 

It is very useful to make the two conditions of a two-condition modulation correspond to the binary 
digits 0 and 1. Such equivalence will facilitate the transmission of numbers resulting from binary calculation, the 
conversion of codes for binary numbers ahd of codes for decimal numbers, maintenance operations and relations 
between transmission personnel and the personnel in charge of data-processing machines. 

At first sight, it does not seem to matter whether the symbol 0 corresponds in transmission to condition A 
or condition Z. the symbol 1 then corresponding to condition Z or condition A or vice versa. 

In telegraphy, however, when a telegraphic communication is set up and the sending of signals is stopped 
(called the idle condition of the line), the signal sent over the line consists of condition Z throughout the 
suspension of transmission. 

It is logical (and for certain VF telegraph systems also essential) to use the same rule in data transmission. 
During the “idle periods” of transmission, condition Z shoulcj,be applied to the circuit input. 

Data transmission on a circuit is often controlled by perforated tape. On perforated tapes used for 
telegraphy, condition Z is represented by perforation. When binary numbers are represented by means of 
perforations, it is custorhary to represent the symbol 1 by a perforation. It is therefore logical to make this 
symbol 1 correspond to condition Z. 


For these reasons, the CCITT 


unanimously declares the following view: 

1 In transmitting data by two-condition code, in which the digits are formed using binary notation, the 

symbol 1 of the binary notation will be equivalent to condition Z of the modulation, and the symbol 0 of the 

binary notation will be equivalent to condition A of the modulation. 

2 During periods when there is no signal sent to the input of the circuit, the circuit input condition is 

condition Z. 
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3 If perforation is used, one perforation corresponds to one unit interval under condition Z. 

4 In accordance with Recommendation R.3I, the sending of symbol 1 (condition Z) corresponds to the tone 
being sent on a channel using amplitude modulation. 

5 In accordance with Recommendation R.35, when frequency modulation is used, the sending of symbol 0 
corresponds to the higher frequency, while the sending of symbol 1 corresponds to the lower frequency. 

6 a) For phase modulation with reference phase: 

the symbol 1 corresponds to a phase equal to the reference phase; 

the symbol 0 corresponds to a phase opposed to the reference phase. 

b) For differential two-phase modulation where the alternative phase changes are 0 degree or 
1 80 degrees: 

the symbol I corresponds to a phase inversion from the previous element; 
the symbol 0 corresponds to a no-phase inversion from the previous element. 

7 A summary of equivalence is shown in Table 1/V.l. 



TABLE 1/V.l 

Summary of equivalence (see Note I ) 



Digit 0 

Digit 1 


’’Start” signal in start-stop code 

Line available condition 
in telex switching 
"Space" element of start-stop code 
Condition A 

"Stop" signal in start-stop code 

Line idle condition 
in telex switching (Note 2) 

"Mark” element of start-stop code 
Condition Z 

Amplitude modulation 

Tone-off 

Tone-on 

Frequency modulation 

High frequency 

Low frequency 

Phase modulation with 
reference phase 

Opposite phase to the reference phase 

Reference phase 

Differential two-phase 
modulation where the 
alternative phase changes 
are 0 degree or 1 80 degrees 

No phase inversion 

Inversion of the phase 

Perforations 

No perforation 

Perforation 


Note 1 - The standardization described in this Recommendation is general, whether over telegraph-type circuits or over circuits of 
the telephone type, making use of electromechanical or electronic devices. 

Note 2 - It primarily applies to anisochronous use. 


Reference 

[1] CCITT Definition: Position A ; position Z, Vol. X, Fascicle X.l (Terms and Definitions). 
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Recommendation V.3 


INTERNATIONAL ALPHABET No. 5 


(Mar del Plata, 1968; amended at Geneva, 1972) 


Introduction 


A seven-unit alphabet capable of meeting the requirements of private users on leased circuits and of users 
of data transmission by means of connections set up by switching on the general telephone network or on 
telegraph networks has been established jointly by the CCITT and the International Organization for Standardiza- 
tion (ISO). 

This alphabet - Alphabet No. 5 - is not intended to replace Alphabet No. 2. It is a supplementary 
alphabet for the use of those who might not be satisfied with the more limited possibilities of Alphabet No. 2. In 
such cases it is considered as the alphabet to be used as common basic language for data transmission and for 
elaborated message systems. 


needs. 


Alphabet No. 5 does not exclude the use of any other alphabet that might be better adapted to special 


1 Scope and field of application 

1.1 This Recommendation contains a set of 128 characters (control characters and graphic characters such as 
letters, digits and symbols) with their coded representation. Most of these characters are mandatory and 
unchangeable, but provision is made for some flexibility to accommodate special national and other requirements. 

1.2 The need for graphics and controls in data processing and in data transmission has been taken into 
account in determining this character set. 

1.3 This Recommendation consists of a general table with a number of options, notes, a legend and 
explanatory notes. It also contains a specific International Reference Version, guidance on the exercise of the 
options to define specific national versions and application oriented versions. 

1.4 This character set is primarily intended for the interchange of information within message transmission 
systems and between data processing systems and associated equipment. 

1.5 This character set is applicable to all Latin alphabets. 

1.6 The character set includes facilities for extension where its 128 characters are insufficient for particular 
applications. 

1.7 The definition of some control characters in this Recommendation assumes that data associated with them 
is to be processed serially in a forward direction. Their effect when included in strings of data which are processed 
other than serially in a forward direction or included in data formatted for fixed record processing may have 
undesirable effects or may require additional special treatment to ensure that the control characters have their 
desired effect. 


Implementation 


2.1 This set should be regarded as a basic alphabet in an abstract sense. Its practical use requires definitions of 
its implementation in various media. For example, this could include punched tapes, punched cards, magnetic 
tapes and transmission channels, thus permitting interchange of data to take place either indirectly by means of an 
intermediate recording in a physical medium, or by local electrical connection of various units (such as input and 
output devices and computers) or by means of data transmission equipment. 

2-2 The implementation of this coded character set in physical media and for transmission, taking also into 
account the need for error checking, is the subject of ISO publications. 
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Basic code table 

TABLE I/V.3 


Basic code 


[assajial 

mmBi 


0 I 1 I 2 I 3 


0 


5 6 


TC,| S 

COLE)! 


ill I TC, DC, 

I M(SQH) I 


0 2 I TC, DC, 

■ CSTX) 


SDSI 
ODQDI 
0000 
boo 
0000 
00001 


1 3 I TC, DC, 

■ CETX) | 


C, I DC, 

0 T ) | 


TC, TC. I % 

(ENO) (NAK)I 


TC. TC. & 

(ACK)| (SYN)O 


belItcJ ' 

I CE T B ) I @ 


can! ( I 8 I H 


EM I ) 


SUB 


1 4i so I is, 

■ |(RS) 


1 5| si is, 

■ I (US) 
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Notes about Table 1/V.3: 



Note I - The Format Effectors are intended for equipment in which horizontal and vertical movements are effected separately If equip- 
ment requires the action ol carriage return to be combined with a vertical movement, the Format Effector for that vertical movement 
may be used to effect the combined movement. For example, if new line (symbol NL. equivalent to CR + LF) is required FE shall be 
used to represent it. This substitution requires agreement between the sender and the recipient of the data. ' 1 

The use of these combined functions may be restricted for international transmission on general switched telecommunication networks 
(telegrapn and telephone networks). 

Note 2 - The symbol £ is assigned to position 2/3 and the symbol $ is assigned to position 2/4. In a situation where there is no requirement 
for the symbol £ the symbol # (number sign) may be used in position 2/3. Where there is no requirement for the symbol $the symbol ££ 
(currency sign) may be used in position 2/4. The chosen allocations for symbols to these positions for international information interchange 
should be agreed between the interested parties. It should be noted that, unless otherwise agreed between sender and recipient, the svmbols 
£. S or n do not designate the currency of a specific country. 

Xmc S - National use positions. The allocation of characters to these positions lies within the responsibility of national standardization 
bodies. These positions are primarily intended for alphabet extensions. If they are not required for that purpose, thev may be used for svm- 
bols. ' J 

Mate 4 - Positions 5/14, 6/0 and 7/14 are provided for the symbols upward arrow head, grave accent and overline However these 
positions may be used for other graphical characters when it is necessary to have 8. 9 or 10 positions for national use. 

Now 5 - Position 7/14 is used for the graphic character -ioverlinei. the graphical representation of which may vary according to national 
use to represent itildei or another diacritical sign provided that there is no risk of confusion with another graphic character included in 
the table. 

Sole 6 - The graphic characters in positions 2/2. 2. 7. 7 12 and 5/14 have respectively the significance of quotation mark, apostrophe 
comma and UPW ARD ARROW HEAD: however, these characters take on the significance of the diacritical siens diaeresis, acute accent! 
cedilla and circumflex accent when they are preceded or followed by the backsp ace character (0/8). 



4 Legend 

4. 1 Control characters 


Abbreviation 

Note of 

Table 1/V.3 

Meaning 

Position in the 
code table 

ACK 


Acknowledge 

0/6 

BEL 


Bell 

0/7 

BS 


Backspace 

0/8 

CAN 


Cancel 

1/8 

CR 

i 

Carriage return 

0/13 

DC 


Device control 

_ 

DEL 


Delete 

7/15 

DLL 


Data link escape 

1/0 

EM 


End of medium 

1/9 

ENQ 


Enquiry 

0/5 

EOT 


End of transmission 

0/4 

F.SC 


Escape 

1/1 1 

ETB 


End of transmission block 

1/7 

ETX 


End of text 

0/3 

FE 


Format effector 


FF 

i 

Form feed 

0/12 

FS 


File separator 

1/12 

GS 


Group separator 

1/13 

HT 


Horizontal tabulation 

0/9 

IS 


Information separator 

_ 

LF 

i 

Line feed 

0/10 

NAK 


Negative acknowledge 

1/5 

NUL 


Null 

0/0 

RS 


Record separator 

1/14 

SI 


Shift-in 

0/15 

SO 


Shift-out 

0/14 

SOH 


Start of heading 

0/1 

SP 


Space (see (j 7.2) 

2/0 

STX 


Start of text 

0/2 

SUB 


Substitute character 

1/10 

SYN 


Synchronous idle 

1/6 

TC 


Transmission control 


US 


Unit separator 

1/15 

VT 

i 

Vertical tabulation 

0/11 
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4.2 Graphic characters 


Graphic 

Notes of 

Table 1/V.3 

Name 

Position in the 
code table 

(space) 


Space (see §7.2) 

2/0 



Exclamation mark 

2/1 


6 

Quotation mark, Diaeresis 

2/2 

£ 

2 

Pound sign 

2/3 

# 

2 

Number sign 

2/3 

$ 

2 

Dollar sign 

2/4 

U 

2 

Currency sign 

2/4 

% 


Percent sign 

2/5 

& 


Ampersand 

2/6 


6 

Apostrophe, Acute accent 

2/7 

c 


Left parenthesis 

2/8 

> 


Right parenthesis 

2/9 



Asterisk 

2/10 

+ 


Plus sign 

2/11 

, 

6 

Comma, Cedilla 

2/12 

- 


Hyphen, Minus sign 

2/13 



Full stop (period) 

2/14 

/■ 


Solidus 

2/15 



Colon 

3/10 

; 


Semi-colon 

3/11 

< 


Less-than sign 

3/12 

= 


Equal sign 

3/13 

> 


Greater- than sign 

3/14 



Question mark 

3/15 


4,6 

Upward arrow head. Circumflex accent 

5/14 

- 


Underline 

5/15 


4 

Grave accent 

6/0 


4, 5 

Overline, Tilde 

7/14 


5 Explanatory notes 

5.1 Numbering of the positions in Table I/V.3 

Within any one character the bits are identified by b 7 , b 6 . . . b„ where b 7 is the highest order, or most 
significant bit, and b, is the lowest order, or least significant bit. If desired these may be given a numerical 
significance in the binary system, thus: 

Bit identification: b 7 b 6 b 5 b 4 b 3 b, b, 

Significance: 64 32 16 8 4 2 1 

In the table the columns and rows are identified by numbers written in binary and decimal notations. 

Any one position in the table may be identified either by its bit pattern, or by its column and row 
numbers. For instance, the position containing the digit 1 may be identified: 

— by its bit pattern in order of decreasing significance, e.g. 011 0001 

— by its column and row numbers, e.g. 3/1. 

The column number is derived from bits b 7 , b 6 and b 5 giving them weights of 4, 2 and 1 respectively. The 
row number is derived from bits b 4 , .b^, b 7 and b t giving them weights of 8, 4, 2 and 1 respectively. 


" Order of transmitting bits is not necessarily the same as shown here. For the order of the transmission of bits, see I in 
Recommendation V.4 or X.4 [1], 



] 

, 
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Diacritical signs 


In the character set, some printing symbols may be designed to permit their use for the composition of 
accented letters when necessary for general interchange of information. A sequence of three characters, comprising 
a letter BACKSPACE and one of these symbols, is needed for this composition, and the symbol is then regarded 
as a diacritical sign. It should be noted that these symbols take on their diacritical significance when they are 
preceded or followed by one BACKSPACE character; for example, the symbol corresponding to the code 
combination 2/7 normally has the significance of APOSTROPHE, but becomes the diacritical sign ACUTE 
ACCENT when it precedes or follows a BACKSPACE character. 


In order to increase efficiency, it is possible to introduce accented letters (as single characters) in the 
positions marked by Note 3 in Table 1/V.3. According to national requirements, these positions may contain 
special diacritical signs. 


Sames. meanings and fonts of graphic characters 


This Recommendation assigns at least one name to denote each of the graphic characters displayed in 
Tables 1/V.3 and 2/V.3. The names chosen to denote graphic characters are intended to reflect their customary 
meanines. However, this Recommendation does not define and does not restrict the meanings of graphic 
characters. Nor does it specify a particular style or font design for the graphic characters. 


Under the provision of Note 3 of Table 1/V.3, graphic characters which are different from the characters 
of the international reference version may be assigned to the national use positions. When such assignments are 
made, the graphic characters should have distinct forms and be given distinctive names which are not in conflict 
with anv of the forms or the names of any of the graphic characters in the international reference version. 


Uniqueness of character allocation 


A character allocated to a position in Table 1/V.3 may not be placed elsewhere in the table. For example, 
in the case of position 2/3 the character not used cannot be placed elsewhere. In particular the POUND sign (£) 
can never be represented by the bit combination of position 2/4. 


V ersions of Table 1/V.3 


6.1 General 


6.1.1 In order to use Table t/V.3 for information interchange, it is necessary to exercise the options left open, 
i.e. those affected by Notes 2 to 5. A single character must be allocated to each of the positions for which this 
freedom exists or it must be declared to be unused. A code table completed in this way is called a version. 


6.1.2 The Notes to Table 1/V.3, the Explanatory Notes and the Legend apply in full to any version. 


International reference version 


This version is available for use when there is no requirement to use a national or an application-oriented 
version. In international information processing interchange the international reference version (Table 2/V.3) is 
assumed unless a particular agreement exists between sender and recipient of the data. 
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The following characters are allocated to the optional positions of Table 1/V.3: 


Number sign 

2/3 

Currency sign 

2/4 

Commercial at 

4/0 

Left square bracket 

5/11 

Reverse solidus 

5/12 

Right square bracket 

5/13 

Left curly bracket 

7/11 

Vertical line 

7/12 

Right curly bracket 

7/13 

CCITT-43340 


It should be noted that no substitution is allowed when using the international reference version 


6.3 


National versions 


6J.1 The responsibility for defining national versions lies with the national standardization bodies. These bodies 
shall exercise the options available and make the required selection. 

6 h 3 ,'n J f S ° r ® q “' re< |' m ° re J tt ’ an one nationai version can be defined within a country. The different versions 
sha I be separately identified. In particular when tor a given national use position, e.g. 5/12 or 6/0 alternative 
characters are required, two different versions shall be identified, even if they differ only'by this single’cha™ 

6J.3 If there is in a country no special demand for specific characters, it is stronglv recommended that the 
characters ot the international reference version be allocated to the same national use positions. 


6.4 


Application-oriented versions 


Within national or international industries, organizations or professional groups, application-oriented 

ontm°n7 ., Can | be USed . The y [ e * u ' re P recise agreement among the parties concerned, who will have to exercise the 
options available and to make the required selection. 


7 Functional characteristics related to control characters 

needed iTLe'r' ^ be ' 0W , T Sta,ed general terms and more ex P licit definitions of use may be 

explicit definmJnTaI!d P e h mentatl f n h 0f ' h t COde ' ab ' e 00 record,n 8 media or on transmission channels. These more 
explicit definitions and the use of these characters are the subject of ISO publications. 


7.1 


General designations and control characters 


The general designation of control characters involves a specific class 
number. 


name followed 


by a subscript 


They are defined as follows: 


i 


TC — Transmission control characters 

Control characters intended to control or facilitate transmission of information over telecommunica- 
tion networks. 

The use of the TC characters on the general telecommunication networks is the subject of ISO* 
publications. J 

The transmission control characters are: 

ACK, DLE, ENQ, EOT, ETB, ETX, NAK, SOH, STX and SYN. 
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FE 


Format effectors 


o 

C\J 


-S c p~^t sisiss* irr ion r 

rrr»S,t;“ ” imerP "" d ” d “ pl *» d '“- T1 * den niiions’of fo™,, S„° 


a) 

b) 

c) 


a page is composed of a number of lines of characters; 
the characters forming a line occupy a number of positions called character positions; 


the active position is that character position in which the character about to be processed would 

SET “ P,i "“ d - The MiVe P “ i,l “ d 0 r,““ 


The format effector characters are: 

BS, CR, FF, HT, LF and VT (see also Note 1 to Table 1/V.3). 


DC — Device control characters 


Control characters for the control of a local or remote ancillary device (or devices) connected to , 
data processing and/or telecommunication system. These control oharaot* 0 a 

control telecommunication systems; this should be achieved by the use of TCs.^ ** mtended t0 


Certain preferred uses of the individual DCs are given in § 7.2 below. 


IS 


— Information separators 


Control characters that are used to separate and qualify data logically There are four cnch oh 
meaning^depenc^o^th'eir applications? 3 ' ^ °' hi ^=' latter case their spe^Hc 


When they are used hierarchically, the ascending order is; 
US, RS, GS, FS. 




7.2 Specific control characters 


- * - - 




ACK — Acknowledge 


A transmission control character transmitted by a receiver as an affirmative response to the sender. 


BEL - Bel! 


a A ,te C n 0 ttn 0l de C v h ic r e a s Cter ^ ther6 “ 3 need t0 Ca " for attention = !t "«* control alarm or 


BS — Backspace 


A format effector which moves the active position one character position backwards on the same line. 
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CAN 


Cancel 


A character or the first character of a sequence, indicating that the data preceding it is in error As a 
result this data ,s to be ignored. The specific meaning of this character must be defined for each 
application and/or between sender and recipient. 


CR — Carriage return 


A format effector which moves the active position to the first character 


position on the same line. 


Device controls 


DC1 


DC2 - 


A device control character which is primarily intended for turning on or starting an ancillary device 

(see a so DO and DO 1 ^ '**** 10 reS '° re 8 deviCe t0 the basic mode of operation 

(see also DC 2 and DC3), or for any other device control function not provided by other DCs. 

ff i1 e i V s iC n e n, C °ren r01 h f ° r tumin § on ° r * a ™g a " ancillary device. 

Which r nr! red f0r H hlS purp0se - 11 may be used t0 set a device to a special mode of operation (in 
which case DCI is used to restore the device to the basic mode), or for anv other device control 
function not provided by other DCs. ' ucv.ce control 


DC3 


A device control character which is primarily intended for turning off or stopping an ancillary device 
This function may be a secondary level stop. e.g. wait, pause, stand-by or halt (in which case DCI is 
sed to restore normal operation). II it is not required for this purpose, it mav be used for anv other 
device control function not provided by other DCs. 


DC4 - 


tc d N V,Ce a 0 "'' 0 ' , C r h . araCter which is Primarily intended for turning off, stopping or interrupting an 
ancillary device, f it is not required for this purpose, it may be used for anv other device P control 
function not provided by other DCs. ' comroi 


Examples of use of the device controls : 


i) 

One switching 

on - DC2 

off - 

DC4 

2) 

Two independent 

switchings 




First one 

Second one 

on - DC2 
on - DCI 

off - 
off - 

DC4 

DC3 

3) 

Two dependent switchings 




General 

Particular 

on - DC2 
on — DCI 

off - 
off - 

DC4 

DC3 

4) 

Input and output 

switching 




Output 

Input 

on - DC2 
on - DCI 

off - 
off - 

DC4 

DC3 


DEL — Delete 


A character used primarily to erase or obliterate an erroneous or unwanted character in punched tape 
DEL characters may also serve to accomplish media-fill or time-fill. They may be inserted into or 
removed from a stream of data without affecting the information content of that stream, but then the 
equipment^ rem ° Va ° f these characters may affect 'he information layout and/or the control of 
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DLE 


Data link escape 


A transmission control character which will change the meaning of a limited number of contiguously 

functions Onir^h- h‘ S ^ eX ^ lus,vely . t0 P rovide Supplementary data transmission control 
tunctions. Only graphic characters and transm.ssion control characters can be used in DLE sequences. 


EM 


End of medium 


A control character that may be used to identify the physical end of a me.dium, or the end of the used 
portion of a medium, or the end of the wanted portion of data recorded on a medium. The position 
ot this character does not necessarily correspond to the physical end of the medium. 


ENQ 


Enquiry 


transmission control character used as a request for a response from a remote station - the 
response may include station identification and/or station status. When a “Who are you?” function is 

re ? U kT e I T O' S u neral switched transmission network, the first use of ENQ after the connection is 
established shall have the meaning “Who are you?” (station identification). Subsequent use of ENO 
may, or may not, include the function "Who are you?”, as determined bv agreement 


EOT ~ End oj transmission 


A^transmission control character used to indicate the conclusion of the transmission of one or more 


ESC 


Escape 


A control character which is used to provide an additional control function. It alters the meaning of a 
imited number of contiguously following bit combinations which constitute the escape sequence. 

Escape sequences are used to obtain additional control functions which may provide among other 
things graphic sets outside the standard set. Such control functions must not be used as additional 
transmission controls. 

The use of the character ESC and of the escape sequences in conjunction with code extension 
techniques is the subject of an ISO Standard. 


ETB 


End of transmission block 


A transmission control character used to indicate the end of a transmission block of data where data 
is divided into such blocks for transmission purposes. 

ETX — End of text 

A transmission control character which terminates a text. 


FF 


Form feed 


A format effector which advances the active position to the same character position on a predeter- 
mined line of the next form or page. 


HT - Horizontal tabulation 


A format effector which advances the active position to the next predetermined character 
the same line. 


position on 
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Information separators 


IS, (US) - A control character used to separate and qualify data logically; its specific meaning has to be defined 

for each application. If this character is used in hierarchical order as specified in the general 

definition of IS, it delimits a data item called a UNIT. 

IS, (RS) - A control character used to separate and qualify data logically; its specific meaning has to be defined 

for each application. If this character is used in hierarchical order as specified in the general 

definition of IS, it delimits a data item called a RECORD. 

IS, (GS) - A control character used to separate and qualify data logically; its specific meaning has to be defined 

for each application. If this character is used in hierarchical order as specified in the general 

definition of IS, it delimits a data item called a GROUP. 

IS 4 (FS) - A control character used to separate and qualify data logically; its specific meaning has to be defined 

for each application. If this character is used in hierarchical order as specified in the general 

definition of IS, it delimits a data item called a FILE. 




— Linefeed 


A format effector which advances the active position to the same character position of the next line. 


Negative acknowledge 


A transmission control character transmitted by a receiver as a negative response to the sender. 


A control character used to accomplish media-fill or time-fill. NUL characters may be inserted into or 
removed from a stream of data without affecting the information content of that stream, but then the 
addition or removal of these characters may affect the information layout and/or the control of 
equipment. 


— Shift-in 


A control character which is used in conjunction with SHIFT-OUT and ESCAPE to extend the 
graphic character set of the code. It may reinstate the standard meanings of the bit combinations 
which follow it. The effect of this character when using code extension techniques is described in an 
ISO Standard. 


— Shift-out 


A control character which is used in conjunction with SHIFT-IN and ESCAPE to extend the graphic 
character set of the code. It may alter the meaning of the bit combinations of columns 2 to 7 which 
follow it until a SHIFT-IN character is reached. However, the characters SPACE (2/0) and DELETE 
(7/15) are unaffected by SHIFT-OUT. The effect of this character when using code extension 
techniques is described in an ISO Standard. 


— Start of heading 


A transmission control character used as the first character of a heading of an information message. 
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SP 


Space 


STX 


SUB 


SYN 


A character which advances the active position one character position on the same line. This character 
is also regarded as a non-printing graphic. 

Star t of text 

A transmission control character which precedes a text and which is used to terminate a heading. 
Substitute character 

A control character used in the place of a character that has been found to be invalid or in error. 
SUB is intended to be introduced by automatic means. 

Synchronous idle 

A transmission control character used by a synchronous transmission system in the absence of any 
other character (idle condition) to provide a signal from which synchronism may be achieved or 
retained between data terminal equipment. 

Vertical tabulation 


CM 


A format effector which advances the active position to the same character position on the next 
predetermined line. 

Reference 

[1] CCITT Recommendation General structure of signals of International Alphabet No. 5 code for data 
transmission over public data networks. Vol. VIII, Fascicle VIII. 2, Rec. X.4. 


Recommendation V.4 


GENERAL STRUCTURE OF SIGNALS OF INTERNATIONAL ALPHABET No. 5 CODE 
FOR DATA TRANSMISSION OVER PUBLIC TELEPHONE NETWORKS 11 

(Mar del Plata. 1968 : amended at Geneva. 1976 and 1980) 


The CCITT, 

I . considering, firstly. 

the agreement between the International Organization for Standardization (ISO) and the CCITT on the 
main characteristics of a seven-unit alphabet (International Alphabet No. 5) to be used for data transmission and 
for telecommunications requirements that cannot be met by the existing five-unit International Alphabet No. 2; 

the interest, both to the users and to the telecommunication services, of an agreement concerning the 
chronological order of transmission of bijs in serial working; 

declares the view 

that the agreed rank number of the unit in the alphabetical table of combinations should correspond to the 
chronological order of transmission in serial working on telecommunication circuits; 


See Recommendation X.4 [1] for data transmission over public data networks. 
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that, when this rank in the combination represents the order of the bit in binary numbering, the bits 
should be transmitted in serial working with the low order bit first; 

that the numerical meaning corresponding to each information unit considered in isolation is that of the 

digit: 

0 for a unit corresponding to condition A (travail = space), and 

1 for a unit corresponding to condition Z (repos = mark), 

in accordance with the definitions of these conditions for a two-condition transmission system; 

II. considering, moreover, 

that it is often desirable, in data transmission, to add an extra "parity" unit to allow for the detection of 
errors in received signals; 

the possibility offered by this addition for the detection of faults in data terminal equipment; 

the need to reserve the possibility of making this addition during the transmission itself, after the seven 
information units proper have been sent; 


declares the view 

that signals of the International Alphabet No. 5 code for data transmission should, in general, include an 
additional "parity” unit; 

that the rank of this unit and. hence, the chronological order of the transmission in serial working should 
be the eighth of the combination thus completed; 


III. considering 

that, in start-stop systems working with electromechanical equipment, the margin of such equipment and 
the reliability of the connection are considerably increased by the use of a stop element corresponding to the ^ 

duration of two unit intervals of the modulation; cn 

that for transmissions over telephone circuits via modems installed on the user's premises, the latter must 
be able to use the connections at the highest possible practical rate in characters per second, and that in such a 
case a single-unit stop element leads to a gain of about 10% as regards this practical rate; 

that, however, it does not appear that the production of electronic devices capable of working at will with 
start-stop signals having a stop element equal to one or two unit intervals should lead to costly complications and 
that such an arrangement can have the advantage of appreciably limiting the error rate without greatly reducing 
the practical efficiency of the connection; 


declares the view 


that in start-stop systems using combinations of the seven-unit alphabet normally followed by a parity 
unit, the first information unit of the transmitted combination should be preceded by a start element corres- 
ponding to condition A (space); 

that the duration of this start element should be a one-unit interval for the modulation rate under 
consideration, at transmitter output; 

that the combination of seven information units, normally completed by its parity unit, should be followed 
by a stop element corresponding to condition Z (mark); 

that for start-stop systems using the seven-unit code on switched telephone networks, a two-unit stop 
element should be used with electromechanical data terminal equipments operating at modulation rates up to and 
including 200 bauds. In other cases, the use of a one-unit stop element is preferable. However, this is subject to a 
mutual agreement between Administrations concerned; 

that similar situations when a one-unit stop element can be used may apply to leased circuits; 

that the start-stop receivers should be capable of correctly receiving start-stop signals comprising a 
single-unit stop element, whose duration will be reduced by a time interval equal to the deviation corresponding 
to the degree of gross start-stop distortion permitted at receiver input. However, for electromechanical equipment 
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CD 

CNJ 


which must use a two-unit stop element (eleven-unit code signal) with a modulation rate of 200 bauds or less 
receivers should be capable of correctly receiving signals with a stop element reduced to one unit; 

I V. considering, finally, 

that the direction of the parity unit can only be that of the even parity on the perforated tapes oarticularlv 
owing to the possibility of deletion (combination 7/15 of the alphabet) which causes a hole to appear irfall tracks! 

that, on the other hand, the odd parity is considered essential in the equipment which depends on 
ransitions in the signals to maintain synchronism [in cases where combination 1/6 (SYNC) of the alphabet does 
not permit of an economical solution); 1 ’ alphabet does 

declares the view 

<h' principle of " ‘ h ' in oper.red on 

that this parity should be odd on links or connections using end-to-end synchronous operation; 

that arrangements should be made when necessary to reverse the direction of the parity unit at the input 
and output of the synchronous equipment connected either to apparatus working on thestart-stop principled 
receiving characters on perforated tape; P pnnci P le or 

that the detection of a character out-of-parity may be represented by: 

^ KnToa, q r i0n P f aph ' C character or a representation of the capital letters SB (see 

2 ° 4 , 7 ‘ 2 * J ) provlded that these letters 0CCU P- V a single character position on the screen or printer 
and could have been entered by a single key stroke, recognizing it may be difficult to achieve a legible 
bb character from some matrix printers or displays where the characters are printed- or 


b) 


a recording of the 1/10 (SUB) character on the tape or other storage medium, where provided 


and that, where a SUB character occurs in a received transmission, or is presented to a DTE via a storage 
medium, e.g. paper tape, then the reaction should be as in a) and b) above. 8 


[1] 


References 

CCITT Recommendation General structure of signals of International Alphabet No. 5 code for data 
transmission over public data networks, Vol. VIII, Fascicle VI 1 1.2, Rec. X.4. J 

ISO Standard^MT”^] re P resen,a,ions f or the coMr ° l characters of the 7-bit coded character set. 


Recommendation V.5 


STANDARDIZATION OF DATA SIGNALLING RATES FOR 
SYNCHRONOUS DATA TRANSMISSION IN THE GENERAL SWITCHED TELEPHONE NETWORK 

(former Recommendation V.22. Geneva, 1964 ; 
amended at Mar del Plata, 1968. at Geneva, 1972 and 1976) 


1 Data transmission by international communications carried on the general switched telephone network 
using a synchronous transmission procedure will be done with a specific mode of modulation, two- or 
multi-condition, and sena transmission (see Note 1). For synchronous data transmission on leased telephone-type 
circuits see Recommendation V.6. p e l y pe 


2 The data signalling rates for synchronous transmission in the general switched telephone network will be: 

600, 1200, 2400 or 4800 bits/s (see Note 2). 

connection Ch °° Se am ° ng rat£S ‘ accordance with their needs and the facilities afforded by the 
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